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FLUORESCENT PROTEIN INDICATORS 

This invention was made with Government support under Grant No. NS27J77 
awarded by the National Institutes of Health. The Government has certain rights ,n 
this invention. 



TECHNICAL FIELD OF THE INVENTION 
The invention relates generally to fluorescent proteins and more particular^ to 
compositions and methods for measuring the response of a sensor polypeptide to an 
environmental f, ... biological, chemical, electrical or physiological) parameter. 

BACKGROUND 

Fluorescent Ca'" indicators such as fura-2, indo- 1 . fluo-3, and Calau.n-Green 
have been the mainstay of intracellular Ca^ measurement and imaging (see for 
example, U.S. Pat. No. 4,603,209 and U.S. Pat. No. 5.049.673). These relative,, low 
molecular weight indicators can suffer from many technical problems relating to ester 
loading, leakage of the dyes from the cell, companmentation in organelles and 
perturbation of the indicators by cellular constituents. Although the Ca-- indicating 
photoprotein aequorin is targetable, the photoresponse to Ca 3 ' 1S low since it is 
chemiluminescent. Moreover, aequonns need to incorporate exogenous 
coelenterazine. 
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Many effects of Ca'* in cells are mediated by Ca* binding to calmodulin 
(CM), which causes CaM to b.nd and activate target prote.ns or pept.de sequences 
Based on the NMR structure of CaM bound to the 26-residue Ml 3 Ca 2 *-binding 
peptide of myosin light-chain kinase, Pommb « „/. f used the C-terminus of CaM via 
a Gly-Gly spacer to M13. Ca-" binding switches the resulting hvbrid protein (CaM- 
M.3) from a dumbbell-like extended form to a compact globular form Similar to the 
CaM-Ml3 intermodular complex (see, Porumb, T„ „ „/., Prot.Engincerin« 7-109- 
30 115 (1994))... 

Measurement of a binding member concentration /„ >,„•«, or in vivo by non- 
mvasive techniques can help elucidate the physiological function of the bindin* 
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member. This can also aid in identifying changes that occur in a cell or organism in 
response to physiological stimuli. For example, cyclic AMP can be detected by 
fluorescence resonance energy transfer between separately labeled proteins that 
associate with each other but are not covalently attached to each other. See, U.S. Pat. 
No. 5,439,797. 

The Aequurea victoria Green Fluorescent Protein (GFP) is useful as a marker 
for gene expression, as a fluorescent tag to aid in visualizing protein trafficking, and 
as a component of indicator systems that allow fluorescent sensing of small molecules 
and pH. Currently, GFPs use as a biosensor is limited to those systems that use GFP 
fusion proteins as partners for fluorescence resonance energy transfer (FRET) or those 
that use the subcellular redistribution of GFP fusion proteins as indicators of substrate 
concentration or the measurement of pi I. 

Currently, fluorescent molecules designed to measure interactions of proteins 
rely on cameleon molecules of tandem GFP constructs. In these constructs, 
conformational changes occur and alter the FRET between the GFPs and a ratiometric 
color change is noted. Such cameleon or FRET-sensitive constructs are large 
molecules in which protein conformation influences FRET efficiency of two GFPs of 
different colors. Although insertions into Green Fluorescent Protein have been 
attempted (see Abedi et ai\ Nucleic Acids Research. 26(2):623-630 (1998)) such 
insertions have been made to optimize the presentation of short peptide libraries and 
not to present binding molecules or sensor polypeptides. Additionally, such insertions 
have been only short insertions of about six ammo acids in length. Until now, 
however, it has not been possible to make a single GFP molecules' fluorescence 
sensitive to a substrate other than hydrogen ions. There currently is a desire for 
smaller constructs useful in measuring interactions of molecules in vitro and in vivo. 

SUMMARY Ol THE INVENTION 
The inventors have discovered that when a sensor polypeptide is inserted into 
an Aequorea-rclvLied fluorescent protein (e.g.. Green Fluorescent Protein (GFP), 
Yellow Fluorescent Protein (YFP) or Cyan Fluorescent Protein (CFPj) to form a 
construct, interaction of the sensor polypeptide with a biological, chemical, electrical 
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or phys.ological parameter, for example, results ,n a change ,n fluorescence of the 
^orescent prote.n. Such con,™, are useful ,n measuring .n.eract.ons of a sensor 
polypepndes. with environmental stimuli /„ vi ,ro or /„ viw or m mcasunng 
characteristic* of a cell (,,,, redox potential, intracellular ion concentration; These 
constructs rely on the responsiveness of a sensor polypeptide inserted w„hin a GFP- 
sensor-relatcd prote.n .tsclf to influence the actua. fluorescence of the fluorophore and 
not the interaction of tandem fluorescent molecules. 

Accordingly, the present invention provides an isolated nucleic acid sequenc- 
wh.ch encodes a fluorescent indicator or chimeric construct, the indicator havtn* a 
sensorpolypept.de which ,s responsive to a chemical, b,olo a ,cal. electneal or ~ 
Physiological parameter, and a fluorescent protein moietv. wherein .he sensor 
polypcpt.de ,s operative* insened into the fluorescent protein moietv. and wherein 
the fluorescence of the fluorescent prote.n mo.etv , s affected by me responsiveness of 
the sensor polypeptide. The fluorescent prote.n moiety can be anv fluorescent 
proton, for example, an ,<( W<?a - re lated fluorescent protein moietv More 
specifically, the ^ W a-related fluorescent prote.n can be. for example a GFP 
CFP or YFP moiety. The sensor polypeptide may be any polypeptide moiety for' 
example, a moiety that undergoes a conformational change upon interaction with a 
molecule, oxidation-reduction, or changes in electrical or chcrmcal potential The 
md,cator may further include a linker moiety, linking the N- and C-term,na, ammo 
ac.ds of the sensor polypeptide to the fluorescent prote.n. The linker may be any 
mo.ety that provides for linking of the sensor polypeptide to the fluorescent protein' 
mo.ety such as for example, a nucleic acid that encodes GGTGEL (SEQ ID NO - J ) or 
FKTRHN (SEQ ID NO:2). Two or more linker moieties may be attached to two 
separate polypeptides, that together form a sensor polypeptide. Additionally the 
mdicator may have a localization sequence, for localizing the indicator, for example 
to a particular organelle of a cel.. The sensor polypeptide or linker moietv may be ' 
inserted at numerous sites including, for example, one or more am.no acids between 
residues 128- MS. residues 155-160, res .dues 1 6S- 1 76 or residues 227-229 of the 
fluorescent protein moiety (e.g., GFP). More particularly V 1 45 :s used for in sen, on 
of the linker or sensor polypeptide. 
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In another embodiment, the present invention prov ides a transgenic non- 
human animal having a nucleic acid sequence which encodes a fluorescent indicator 
or chimeric construct, the indicator having a sensor polypeptide which is responsive 
[0 a chemical, biological, electrical or physiological parameter., and a fluorescent 
protei n moiety, wherein the sensor polypeptide is opcratively inserted into the 
fluorescent protein moiety, and wherein the fluorescence of the fluorescent protein 
moiety is affected by the responsiveness of the sensor polypeptide. 

In yet another embodiment, the present invention provides an expression 
vector having expression control sequences operatively linked to a nucleic acid 
sequence coding for the expression of a fluorescent indicator. The indicator having a 
sensor polypeptide which is responsive to a chemical, biological, electrical or 
physiological parameter, and a fluorescent protein moiety, wherein the sensor 
polypeptide is operatively inserted into the fluorescent protein moiety, and wherein 
the fluorescence of the fluorescent protein moiety is affected by the responsiveness of 
the sensor polypeptide. 

In another embodiment, the present invention prov ides a host cell transfected 
with an expression vector having an expression control sequence operatively linked to 
a sequence coding for the expression of a fluorescent indicator. The host cell can be 
any host cell capable of transfection and expression of the indicator such as. for 
example, a'prokaryote (e.g., E.'Colt). a eukaryotic cell (e.g.. a yeast eel I) or a 
mammalian cell. 

In yet a further embodiment, the present invention provides a fluorescent 
indicator polypeptide, the indicator having a sensor polypeptide which is responsive 
to a chemical, biological, electrical or physiological parameter, and a fluorescent 
protein moiety, wherein the sensor polypeptide is operatively inserted into the 
fluorescent protein moiety, and wherein the fluorescence of the fluorescent protein 
moiety is affected by the responsiveness of the sensor polypeptide. - 

In another embodiment, the present invention prov ides a fluorescent indicator 
or chimeric construct; the indicator having a sensor polypeptide which is responsive 
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>o a chem,cal. biological, electrical or physiological parameter, and a fluorescent 
prote.n moiety, wherein the sensor polypeptide ,s operarivelv inserted into the 
fluorescent prote.n moiety, and wherein the fluorescence of the fluorescein protein 
mo,ety ,s affected by the responsiveness of the sensor polypeptide the responsiveness 
resulting m pronation or deprotonation of the chromophore of the fluorescent 
protein moiety. 

In yet another embodiment, the present invention provides a method for 
detecting the presence of a environmental parameter ,n a sample, bv contacting the 
sample w„h a fluorescent indicator or chimeric construct, the indicator having a 
sensor polypeptide which is responsive to a chemical, bioloeical. electrical or 
Physiological parameter, and a fluorescent prote.n mo.etv. wherein the sensor 
polypeptide ,s operative* inserted into the fluorescent prote.n mo.etv. and wherein 
the fluorescence of the fluorescent prote.n moiety , s affected by ,hc responsiveness of 
the sensor polypeptide, and detecting a change ,n fluorescence wherein a chance is 
md.cat.ve of the presence of a parameter which affects the sensor polypeptide. 0 

Jn another embodiment, the invention provides an isolated nucle.c ac.d 
sequence encoding a circularly permuted fluorescent prote.n and the polypeptide 
encoded thereby, having a l.nker moiety linking the am,no-tcrminal and carboxy- 
term,nal ammo acids of a fluorescent protein, wherein the ammo and carboxv termini 
are linked as internal am.no acids in the circularly permuted fluorescent protein 
mo,ety; and two termina. ends, wherein the first end is an am.no-term.nal end and the 
second end ,s a carboxy terminal end and wherein the amino and carboxv terminal 
ends of the circularly permuted fluorescent protein moiety are different from the 
ammo-terminal and carboxy-terminal amino acids of the fluorescent prote.n. 

In another embodiment, the invention provides an expression vector 
comprising expression control sequences operative!;, linked to a nucle.c acid sequence 
coding for the expression of a fluorescent .nd.ca.or. the indicator hav.no a , jnker 
mo.ety l.nk.ng the ammo-terminal and carboxy- termmal ammo acids of a fluorescent 
prote.n. wherem the am.no and carboxy termini are linked as internal am.no acids in 
the crcularly permuted fluorescent protein mo.ety; and two tenn.nal ends, where.n 
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the first end is an amino-termmal end and the second end is a carboxy terminal end 
and wherein the amino and carboxy terminal ends of the circularly permuted 
fluorescent protein moiety are different from the ammo-terminal and carboxy- 
terminal amino acids of the fluorescent protein. In a further embodiment, the 
5 invention provides a host cell containing the expression vector. 

in yet another embodiment, the invention provides a method of producing a 
nucleic acid sequence encoding a fluorescent indicator by linking a nucleic acid 
sequence encoding a linker moiety to the 5' nucleotide of a polynucleotide encoding a 
10 fluorescent protein, circularizing the polynucleotide with the nucleic acid sequence 
encoding the linker sequence, and cleaving the circularized polynucleotide with a 
nuclease, wherein cleavage linearizes the circularized polynucleotide. 

In yet another embodiment, the invention provides a method of producing a 
1 5 circularly permuted fluorescent protein by expressing a nucleic acid sequence 

encoding a linker moiety linking the amino-icrrninal and carboxy-tcrmmal amino 
acids of a fluorescent protein, wherein the amino and carboxy termini are linked as 
internal amino acids in the circularly permuted fluorescent protein moiety; and two 
terminal ends, wherein the first end is an amino-termmal end and the second end is a 
20 carboxy terminal end and wherein the amino and carboxy terminal ends of the 

circularly permuted fluorescent protein moiety are different from the ammo-terminal 
and carboxy-terminal ammo acids of the fluorescent protein 

The details of one or more embodiments of the invention are set forth in the ■ 
25 accompanying drawings and.the description below. Other features, objects, and . 

advantages of the invention will be apparent from the description and drawings, and 
from the claims. 

DESCRIPTION OF THE DRAWINGS , 

30 FIG. 1 shows the designs of a calmodulin or a Zif26S insertion into a ; 

fluorescent indicator of the present invention. . . . 

FIG. 2A shows pH effects on a calmodulin insertion indicator. 
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FIG. 2B shows titration curves for a calmodulin insertion indicator. 

FIG. 3 further shows the effects of calcium concern™ on fluorescence of a 
fluorescent indicator of the invention con,a,n,ng a sensor polypeptide of calmodulin. 
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FIG. 4 show the effect of Zn concentration on fluorescence of a .fluorescent 
indicator of the invention containing a sensor polypeptide of inserted Zif polypeptide. 

FIG. 5 shows the overall design of a circularly permuted polypeptide. 

FIG. 6 shows the result of random digestion of a circularly permuted nucleic 

acid sequences. 

FIG. 7 shows insenions into GFP. 

FIG. 8 shows a schematic diagram of potential ,nsen,onaI sites nnd sites for 
circular permutations in GFP. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to polynucleotides encoding fluorescent 
indicators, fluorescent indicators having a sensor polypeptide, and methods of 
producing and using the same. 

The inventors have discovered a number of Sites in .-/e^, ^-related 
fluorescent pro.e.n moiet.es that aretolernnt to insenions and rearrangements The 
inventors have additionally discovered that the insertion of sensory pel vpepfdes into- 
such sites results in polypeptides that are useful in detection of chemical, biological, 
electrical or physiological parameters, for example Such insertion sites include, for 
example, one or more amino acids between residues I.2S - 14b'. residues 155-160 
residues 168-176 or residues 227-229 of the fluorescent protein moiety. Other ' 
positions which may tolerate insenions include, hut are nor limited to. residues 49-50 
78-79,116-117.134-135.140-141.157-158, 17M73. I l >4- 1 95. I S'9- 1 90 and 2 1 3-2 1 4 
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(see Abedi er a/.. Nucleic Acids Research. 26( 2):623-630 ( J 998). the disclosure of 
which is incorporated herein). More specifically. :he insertion is a VI 45. 

Additionally, the inventors have discovered that when a sensor polypeptide is 
5 inserted into an .^//^/-^/-related fluorescent protein (<?.y. f GFP, YFP or CFP) that 
provides a response related to an interaction with a biological, chemical, electrical or 
physiological parameter, the responsiveness results in a change in fluorescence of the 
fluorescent protein. Such constructs are useful in measuring interactions of a sensor 
polypeptides with environmental stimuli in vitro or in vivo. These new constructs 
10 rely on, for example, detectable changes within a GFP-sensor-related protein itself to 
influence the actual fluorescence of the fluorophore and not the interaction of tandem 
fluorescent molecules. For example, when calmodulin is inserted into YFP at position 
Y145, interaction of calmodulin with its ligand (e.g.. calcium) results in a change in 
the brightness of the fluorescent protein of between 2-8 fold. 

15 

The indicators of the present invention, are advantageous due to their reduced 
size as compared to the FRET- based sensors described above. The reduced size has 
importance in allowing the indicator to measure chemical, biological, electrical or 
physiological interactions with the sensor polypeptide in, for example, subcellular 
20 compartments previously inaccessible to the larger. FRET-based sensors. In addition, 
the maximal change in fluorescence intensity observed in the present indicators (e.g., 
up to S fold increase) are much larger than those in the cameleons (e.g.. -FRET-based 
sensorsi. which show only a 2 fold change in yellow to cyan intensity ratio. 

25 Accordingly, the invention provides polynucleotides and nucleic acid 

sequences encoding fluorescent indicators having a fluorescent protein moiety and a 
sensor polypeptide, or fragments thereof, inserted in operable association into the 
fluorescent protein moiety, in which the sensor polypeptide is responsive to an . 
environmental parameter (e.g.. a chemical, a biological, a electrical, or a physiological . 

30 parameter). Accordingly, the responsiveness of the sensor polypeptide causes a 
change in fluorescence of the fluorescent indicator. The degree of change in the 
fluorescence of the indicator is sensitive topH. 
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As used herein, "operatively inserted" or "operably inserted" is meant between 
two amino acids of a polypeptide or two nucleotides of a nucleic acid sequence. 
Accordingly, .nsen.on excludes Heating or attaching a polypeptide to the las. terminal 
ammo acid or nucleotide in a sequence. 

.As used herein, a "detectable change" or "responsiveness" means any 
response of a polypeptide to a chemical, biolog.cal, electrical, or physiological 
parameter or stimuli. A response includes small changes, for example, a shift in the 
orientation of an ammo acid or peptide fragment of the sensor polypeptide as well as, 
for example, a change in the primary, secondary, or tertiary structure of a polypeptide, 
including for example, changes in pronation, electrical and chemical potential and 
or conformation. "Conformation" is the three-dimensional arrangement of the 
primary, secondary and tertiary structure of a molecule including side groups in the 
molecule: a change in conformation occurs when the three-d.mensional structure of a 
molecule changes. Examples of conformational changes include a shift from -helix 
to a -sheet or a shift from -sheet to a -helix. It is understood that detectable changes 
need not be a conformational change, so long as the fluorescence of the fluorescent 
protein moiety is altered. 
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'■Fragments" as used herein are a portion of a naturally occurring sensor 
protein which can exist in at least two different states or conformations. Fragments 
can have the same or substantially the same amino acid sequence as the naturally 
occurring protein. "Substantially the same" means that an ammo acid sequence is 
largely, but not entirely, the same, but retains a functional activity of the sequence to 
which it is related. In general two amino acid sequences are substantially the same' 
or ■•substantially homologous" i f they are at least S5% identical. Fragments which 
have different three dimensional, structures as the naturally occurring protein are also 
included. The term "responsive" as used herein is intended to encompass any 
response of a polypeptide which is related to an interaction of a chemical, biological, 
30 electrical, or physiological parameter with a sensor polypeptide (^..conformational 
change m j voltage-gated ion channel (e.g.. Shaker; in detection of membrane voltage 
across a biological membrane; phosphorylation of a hormone receptor resulting in a 
conformational change in the receptor upon hormone stimulation). 
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Example of sensor polypeptide useful in the present invention include 
calmodulin, a calniodulin-related protein moiety, recoverin. a nucleoside diphosphate 
or triphosphate binding protein, an inositol- 1 ,4.5-triphosphnte receptor, a cyclic 
nucleotide receptor, a nitric oxide receptor, a growth factor receptor, a hormone 
5 receptor, a ligand-binding domain of a hormone receptor, a steroid hormone receptor, 
a ligand binding domain of a steroid hormone receptor, a cytokine receptor, a growth 
factor receptor, a neurotransmitter receptor, a ligand-gated channel, a voltage-gated 
channel, a protein kinase C, a domain of protein kinase C. a cGMP-depcndent protein 
kinase, an inositol polyphosphate receptor a phosphate receptor, a carbohydrate 
10 receptor, an S112 domain, an SH3 domain, a PTB domain, an antibody, an antigen- 
binding site from an antibody, a single-chain antibody, a zinc-finger domain, a protein 
kinase substrate, a protease substrate, a phosphorylation domain, a redox sensitive 
loop, a loop containing at least two cysteines that can form a cyclic disulfide, and a 
fluorescent protein moiety. 

15 

Where the fluorescent protein contains a second fluorescent protein and a 
sensor moiety within the insert, utilization of FRET based technique- to analyze or 
detect changes in chemical, biological or electrical parameter;, may be performed. For 
example, binding of an analyte such as calcium to a sensor polypeptide such as 

20 calmodulin would change the distance or angular orientation of the two fluorescent 

protein chromophores relative to each other and thereby modulaie FRET. A circularly 
permuted- fluorescent protein may be tandemly or insernunally fused via a sensor 
moiety to a second fluorescent protein (which itself may optionally be a circular 
permutation) so that the FRET between the two fluorescent proteins changes in 

25 response to chemical, biological, or electrical parameters. 

Classes of sensor polypeptides, which may be used in the compositions and 
methods of the invention include, but are not limited to. channel proteins, receptors, 
enzymes, and G-proieins. 
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Channel polypeptides useful with the invention include, but are not limited to 
voltage-gated ion channels including the potassium, sodium, chloride. G-prote.n- 
responsive. and calcium channels. A "channel polypeptide- is typically a ■ 
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polypepi.de embedded ,n ,he cell membrane which is pan of a structure that 
determines what panicle sizes and or charges are allowed to diffuse into the cell. 
Channel polypeptides include the- "v ohagc-g.tcd .on channel,-, wh.ch are proteins 
•mbedded ,n a cell membrane that serve as a crossing point for ihe regulated transfer 
of a specfic ,on or group of ions across the membrane. Specifically, Shaker 
potassium channels or dihydropuridine receptors from skeletal muscle may be 
advantageously used in the present invention. Several ion channel polypeptides of 
use wuh the invention are listed in Table 1. 



u TABLE I: ION CHANNELS 



Gene Product 

Human voltage-gated chloride ion channel 
CLCN5 


! Uenbank Accession No 
! X91906 

i 


Human delayed rectifier potassium channel 
(Isk) gene 


jL338l5 

1 


( Human potassium channel protein (HPCN3) 
gene 


: M555J5 
i 

i 


; Human potassium channel (HPCK2) imRNA) 
i Human potassium channel (HPCN I) (mRiV\) 


1 M555J4 
! M555I3 


| Human gamma subunit oi epithelial amilonde- 
I sensitive sodium channel (mRNA) 


IXH7160 

1 

i 


Human beta subunit of epithelial amilondc- 
sensitive sodium channel 


j -XS7J59 

i 



Channels also include those activated by intracellular signals such as those 
where the Signal is by bindmg of l.gand such as calcium, cyclic nucleotides. G- 
proteins. phosphoinositols, arachidonic acid, for example, and those where the signal 
is by a covalent modification such as phosphorylation, enzymatic cleavage. 
oxidation/Veduction. and acetylation, for example. Channel prote.ns also include 
those activated by extracellular hgands {e .g., inotropic receptors). These can be 
activated by acetylcholine, biogenic am.nes. am.no aods. and ATP. for example. 

A "receptor polypeptide- is a polypeptide found on a cell, often on a 
membrane, that can combine wuh a specific type of molecule, e.^., a ligand, which 
alters a function of the cell. Receptor polypeptides of usew.th the invention include, 
but are no. limited to. the growth factor receptors, hormone receptors, cytokine 
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receptors, chemokine receptors, neurotransmitter receptors, ligand-gated channels, 
and steroid receptors. Specifically polypeptides encoding insuiin-iike growth factor, 
insulin, somatostatin, glucagon, mierieukins, e.g.. JL-2 : transforming growth factors 
(TGF-u . TGF-p ). platelet-derived grow th factor ( PDGF). epidermal growth factor 
(EGF), nerve urowih factor (NGF). fibroblast growth factor (FGF), interferon-y (IFN- 
7). and GM-CSF receptors are of use with the invention. Receptors such as those 
where binding ofhgand is transmitted to a G-protein (e\g. : for 7-transmembrane 
receptors) or kinase domains (for single transmembrane receptors) can be used with 
the invention. These can be activated by acetylcholine, biogenic amines, amino acids, 
ATP, and manv peptides, such as opioids, hypothalamic-releasing hormones, 
neurohypophyseal hormones, piruitary hormones, tachykinins, secretins, insulins, 
somatostatins, and gastrointestinal peptides. Several receptor polypeptides of use with 
the invention are listed in Table 2. 
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TABLE 2: RECEPTORS 



Gene Product 


Genbank Accession No. 


Human insulin receptor gene 


M29929 


Human somatostatin receptor gene 


L 1 4856 


Human IL-2 receptor gene 


X01057, X0105S, XD1402 


Human TGF receptor (mRNA) 


M8509 


Human PDGF receptor ( mRNA) 


M22734 


Human EGF receptor gene 


XO6370 


Human NGF receptor (mRNA) 


M 14764 


Human FGF receptor (mRNA) 


M34641 


Human GM-CSF receptor (mRNA) 


M73832 


Human IFN-y receptor (mRNA) 


X6246S 



An "enzyme" is a polypeptide that acts as a catalyst, which speeds the rate at 
which biochemical reactions proceed do not alter the direction or nature of the 
20 reaction Enzyme polypeptides useful in the invention'include. but arc not limited to, 
protein kinases, catalyses, amidase, phosphatases, guanylyl and adenylyl cyclases', and 
lipoxygenases. Polypeptides encoding the serine/threonine protein kinases arc of use 
with the invention. Several genes encoding human enzymes of use with the invention 
are listed in Table 3. 
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TABLE 3: ENZYMES 



Gene Product " (V. n i, 1r ,i. a — : ~^ — 

-rj I'tnaank A ccession No 

Human cAMP dependent protein kinase M90359 ' 

AKAD 79 ( mRNA) 

Human protein kinase C beta ge ne Dl 0022 " 

Human lipid-activate protein kinase PRK-I U33053 " 



(mRNA) 

Human guanine nucleotide binding protein M2 I 142. J0364 7. M2 IT39 



alpha subunit gene 

Human serine/threonine kinase (mRNA) MS 378Q j 



The responsiveness of the sensor polypeptide (e.g.. a change ,n conformation 
or state) that occurs in response to interaction of a sensor polypcpt.de with a chemical, 
biological, electrical or physiological parameter u ill, as discovered by the inventors, 
cause a change in fluorescence of the fluorescence indicator. The change can be the 
result of an alteration in the environment, structure, protonation or ol.gumerization 
starus of the fluorescent indicator or chromophore. The molecular componem 
responsible for a conformational change is known for many enzymes (e.g., Blostien, 
R., et al. (1997) j. Bio). Chem., 272:249S7-93; Shoclstein, Se.E . et „/.. 0993) EMBO 
J., 12:795-802). receptors {e.g. Moyle, W.R., et al., i 1 995) J. Biol. Chem., 270:20020- 
2003 I : Baron, V., et al., ( 1 992), J. Biol. Chem. 267:23290-23294). and channels {e.g., 
Bouzat. A., et al., (1994) Neuron. 13: 1395-1402; Dulhanry. A.M., (1994) 
Biochemistry, 33:4072-79) polypept.de. The optical propen.es (e.g., fluorescence) of 
the indicator which can be altered in response to the conformational change in the 
sensor polypeptide include, but are not limited to. changes in the excitation or 
emission spectrum, quantum yield, extinction coefficient, excited life-time and degree 
of self-quenching for example. The cause of the changes in these parameters may 
include but are not limited to changes in the environment, changes in the rotational or 
vibrational freedom of the sensor, changes in the angle of the sensor with respect to 
the exciting light or the optical detector apparatus, changes in the protonation or 
deprotonation of amino acids or side groups associated with a chromophore or 
changes in distance or dipole orientation between sensors on associated responsive 
polypeptides. 

For example, insertion of a peptide or protein in place of tyrosine- 1 45 (Y145) 
in mutants of GFP increases the sensi tivity of fluorescence to quenching by acidic pH. 
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When the inserted sensor polypeptide responds to a chemical, biological, electrical, or 
physiological parameter and undergoes a detectable change [e.g.. a change in 
conformation), such interactions change the fluorescence via a shift in the acid 
sensitivity. For example, when calmodulin, a calcium sensing protein, replaces 
5 residue Y145 in a yellow mutant of GFF, calcium binding increases the fluorescence 
by up to 8-fold, depending on the pi 1 at which the measurement is made. Other sites 
for insertion into GFP or GFF-mutants are allowable where circular permutation is 
tolerated, as discussed more fully below. 

10 In the fluorescent indicator proteins of the invention, the sensor polypeptide is 

opcrabiy inserted into an optically active polypeptide [e.g., a fluorescent protein 
moiety). A protein-based "optically active polypeptide" is a polypeptide which 
contains a means for emitting light. Fluorescence is one optica! property of an 
optically active polypeptide which can be used as the means of detecting the ' 

15 responsiveness of the sensor or responsive polypeptide of the fluorescent indicator or 
circularly permuted fluorescent proteins of the invention. As used herein, the term 
"fluorescent property" refers to the molar extinction coefficient a; an appropriate 
excitation wavelength, the fluorescence quantum efficiency, the shape of the 
excitation spectrum or em ission spectrum, the excitation wavelength maximum and 

20 emission wavelength maximum, the ratio of excitation amplitudes at two different 
wavelengths, the ratio of emission amplitudes at two different wavelengths, the 
excited state lifetime, or the fluorescence anisotropy. A measurable difference in any 
one of these properties between the active and inactive states suffices for the utility of 
the fluorescent protein substrates of the invention in assays for activity. A measurable •• 

25 difference can be determined by determining the amount of any quantitative 

fluorescent property, e.g., the amount of fluorescence at a particular wavelength, or 
the integral of fluorescence over the emission spectrum. Optimally, the protein 
substrates arc selected to have fluorescent properties that are easily distinguishable in . 
the un-activatcd and activated conformational states. 



30 



One means of measuring fluorescence in a sample uses a fluorimeter. In 
general, excitation radiation, from an excitation source having a first wavelength, 
passes through excitation optics. The excitation optics cause the excitation radiation 
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to excite the sample. In response, fluorescent pro.cms ,n the sample emit radiation 
which has a wavelength that is different. from the excitation wavelength. Collection 
optics then collect .he emission from the sample. The dev,ce can include a 
temperature controller to maintain the sample at a specific temperature while it is 
being scanned. For example, a multi-axis translation stage moves a microliter plate 
holding a plurality of samples in order to position different wells to be exposed. The 
multi-axis translation stage, temperature controller, auto-focusing feature, and 
electronics associated with imaging and data collection can be managed by an 
appropriately programmed digital computer. The computer also can transform the 
data collected during the assay into another format for presentation. Other means of 
measuring fluorescence can also be used with the invention. 

Methods of performing assays on fluorescent materials are well known in the 
art and are described in. e.g., Lakowicz, J.R.. Principles of Fluorescence 
Spectroscopy, New York:Plenum Press (19S3): Herman. B.. Resonance energy 
transfer microscopy, in: Fluorescence Microscopy of L.ving Cells in Culture^Part B, 
Methods in Cell Biology, vol. 30, ed. Taylor. D.L. & Wang. Y.-L.. San Diego: 
Academic Press (1989). pp. 219-243: Turro. NJ.. Modem Molecular Photochemistry, 
Menlo Park: Benjamin/Cummings Publishing Col. Inc. (1 978). pp. 296-361. 

Any fluorescent protein can be used in the invention, including proteins that 
fluoresce due to intramolecular rearrangements or the addition of cofactors that , 
promote fluorescence. For example, green fluorescent proteins of cnidarians, which 
act as their energy-transfer acceptors in bioluminescence, are suitable fluorescent 
proteins for use in the fluorescent indicators. A green fluorescent protein ("GFP") is a . 
protein that emits green light, a blue fluorescent protein CBFP") is a protein that 
emits blue light, a yellow fluorescent protein ("YFP") is one that emits yellow light, 
and a cyan fluorescent protein ("CFP") is one that emits a greenish-blue light. GFPs 
have been isolated from the Pacific Northwest jelly fish, Aequow victoria, the sea 
pansy, Renilla remfonnis. and Phialidium grvgarium. See. W ard. W W.. e , a /., 
Photochem. Photobiol., 35.S03-SOS (1982): and Levine. L.D.. a „•/.. Com P . Biochem. 
Physiol., 72B.77-85 ( 1 9S2). 
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A variety oMt/<yi/c>/fcr/- related GFPs having useful excitation and emission 
spectra have been engineered by modifying the amino acid sequence of a naturally 
occurring GFP from Aequorea victoria. (See.. Prasher. D.C.. t:t c:i.. Gene, 1 I 1:229- 
233 (1992): Heim. R.. et <//., Proc. Natl. Acad. Sci., USA, 91: 12501-04 (1994); U.S. 
Pa;ent No. 5,491,084; 5,625 f 04S; International application PCT US95.1 4692. Hied 
11/10/95). The cDNA of GFP can he concatenated with those encoding many other 
proteins: the resulting chimerics often are fluorescent and retain the biochemical 
features of the partner proteins. (See. Cubiu. A.B.. et a/.. Trends Biochcm. Sci. 
20:44.X-455 ( 1995)). Mutagenesis studies have produced may GKP mutants, some 
having shifted wavelengths of excitation or emission (sec, Hcim. R. & Tsicn, R.Y. 
Current Biol. 6:178-182 (1996)), Suitable pairs, for example a blue-shifted GFP 
mutant P4-3 (Y66H/Y I45F) and an improved green mutant S65T can respectively 
serve as a donor and an acceptor for fluorescence resonance energy transfer (FRET). 
See. Tsieiv R.Y.. et t//.. Trends Cell Biol. 3:242-245 ( 1993). Such proteins are 
included in the invention sensor. A fluorescent protein is an "Aequorea-rtlated 
fluorescent protein*' if any contiguous sequence of 150 amino acids of the fluorescent 
protein has at least 85% sequence identity with an amino acid sequence, either 
contiguous or non-contiguous, from the wild type Aequorea green fluorescent protein. 
More preferably, a fluorescent protein is an Aequorea-rclaiedi fluorescent protein if 
any contiguous sequence of 200 amino acids of the fluorescent protein has at least 
95% sequence identity with an amino acid sequence, either contiguous or non- 
contiguous, from the wild type Aequorea green fluorescent protein. For example, 
where circularly permuted fluorescent proteins are concerned, any continuous 
sequence of the circularly permuted sequence which has identicy to an Aequorea- • 
related fluorescent protein, as described above, whether further N- or C- terminal than 
the comparison sequence is considered related. Similarly, the fluorescent protein can 
be related to Rcnilla or Phialidium wild-type fluorescent proteins using the-same 
standards. Some Aequbrea-rc\a\ed engineered versions described in Table 4. Other 
variants or mutants are within the scope of the invention as described, for example, 1 in 
the Examples. ' 
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TABLE 4 



I Clone 1 Mutation(s) 

1 ! 


| Excitation ! Emission 
max (nm) j max (nm) 


. Extinction i Quanrum 

Coefficient ; vield 
i M cm , 


j Wild type 


j none 


395 (475; j 508 


i 21,000(7,150) ; 0.77 


I P4 


i Y66H | 3S3. | 447 


13,500 10.21 


j P4-3 


Y66H:Y145F | 381 


445 


14,000 ; 0.38 


1 W7 

! 

1 
j 


Y66W:M461 

M 1 53T 
! VI 63 A 
j N212K 


; 433 (453) 


475 (501 ) 


18,000 i 0.67 
(17,100) ! 

1 

1 

! i 


VV2 


! Y66W;1123V 
i Y145H 
i H148R 
i M153T 

VI 63 A 

N2I2K 


i 432 (453) 


480 

1 
I 
l 


1 10,000(9,600) 

i 


! 0.72 j 

j : 

I j 


1 S65T 


S65T 


489 1511 


39.200 


0.68 I 


l P4-1 
j 


:S65T:M153A .504 (396) j514 

K238C I 

. . i . > 


1 4 S00 600 > 


0.53 j 

i 


j S65A 


S65A 


471 


1 504 




1 


I S65C S65C 


479 


507 








S65L i S65L 


4S4 


510 








Y66F j Y66F 


360 


442 








Y66W Y66W 


458 


480 








10c ! S65G;V68L 
S72A;T203Y 


513 


527 






WIB F64L:S65T 

Y66W:M146I 
M153T 
VI 63 A 
;K212K i 


432 (453) 


476 (503) 

i 
| 






Emerald 


S65T.S72A 
N 1 49K 
MI53T 
1 1 67T 


487 


508 i 

1 






Sapphire 


S72A;Y145F | 395 
T203I j 


51 1 






T2031 


395 


51 1 


i 


. 1 



An additional clone, W 1 B 1 included the following mutations: F64L;S65T: 
Y66W: F99S; and VI 63 A. 



Other fluorescent proteins can be used in the fluorescent indicators, such as, 
for example, yellow fluorescent protein from Vibrio fischeri strain Y-l. Peridinin- 
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chlorophvll a binding protein from the dinoflagellate Symbiodiniwn sp. 
phycobiliproieins from mannc eyanobactcna such as Synechococcus. e.g., 
phycoerythnn and phycocyanin. or oat phytochromes from oat reconstructed with 
phycoerythrobiluv These fluorescent proteins have been described in Baldwin, T.O., 
5 eiai. Biochemistry 29:5509-55 15 ( 1990). Morris. B.J.. et aL, Plant Molecular 
Biology. 24:673-677 (1994), and W'ilbanks, S.M., et ul.. J. Biol. Chem. 268:1226- 
1235 (1993.1, and Li ti «L. Biochemistry 34:7923-7930 ( 1 995). Murphy. J.T., & 
Laganas. J.C.. Current Biology 7: 370-876 (1997). 

10 The fluorescent indicators can be produced as chimeric proteins by 

recombinant DNA technology. Recombinant production of fluorescent proteins 
involves expressing nucleic acids having sequences that encode the proteins. Nucleic 
acids encoding fluorescent proteins can be obtained by methods known in the an. For 
example, a nucleic acid encoding the protein can be isolated by polymerase chain 

1 5 reaction of cDN A from A. victoria using primers based on the DNA sequence of A . 
victoria *reen fluorescent protein. PCR methods are described in, for example, U.S. 
Pat. No. 4.683.195; Mullis, et al. Cold Spring Harbor Symp. Quant. Biol. 51:263 
(1987), and Erlich. ed.. PCR Technology, (Stockton Press : NY. I9S9). Mutant 
versions of fluorescent proteins can be made by site-specific mutagenesis of other 

20" nucleic acids encoding fluorescent proteins, or by random mutagenesis caused by 
increasing the error rate of PCR of the original polynucleotide with 0. 1 miVl MnCI 2 
and unbalanced nucleotide concentrations. 

In the chimeric proteins of the invention, the sensor polypeptide is operably 
25 inserted to an optically active polypeptide, which responds a conformation 

change)to, for example, a cell signaling event. Cell signaling events that occur in vivo 
can be of very short duration. The optically active polypeptides of the invention 
allow measurement of the optical parameter, such as fluorescence, which is altered in , 
response to the cell signal, over the same time period that the event actually occurs. 
30 Alternatively, the response can be measured after the event occurs (over, a longer time 
period) as the response thai occurs in an optically active polypeptide can be of a 
loneer duration than the cell signaling event itself. 
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In anorhcr embodm.cn.. the invention provides .solated nucleic acid sequence 
which encode fluorescent indicator polypeptides having operative* inserted therein a 
sensor polypeptide, or fragment thereof, which normally exists in one stare e g. 
conformational shape or charge, pnor to an interaction with a chemical, biological 
e.ectnca. or phys.olog.ca. parameter at which time „ undergoes a response durin* or 
after the interaction of the chemical, biological, electrical or phys,o.og,ca> parameter 
with the sensor polypeptide. 

"Polynucleotide" or "nucleic acid sequence" refer, to a polvmenc form of 
nucleotides at least .0 bases in length. By "isolated nude.c acd sequence" is meant a 
polynucleotide that is no longer immediately contiguous ,„„ bo,h of the codin- 
sequences with which „ was immediately contiguous , one on the 5" end and one on 
•he 3" end; i„ the naturally occurring genome of the organ,,,,, from which „ IS den ved 
The term therefore includes, for example, a recombinant DNA which is incorporated 
•nro a vector; mto an autonomously replicating plasmid or virus: or ,n.o the uenormc 
DNA of a prokaryotic or eukaryotic cell or organism, or which exists as a separate 
-olecule (e .g. a cDNA) independent of other sequences. The nucleotides of the 
mvent.on can be ribonucleotides, deoxynbonuc.eot.des. or modified forms of either 
nucleotide. The tenn includes single and double stranded forms of DNA. 

Nucleic acd sequences which encode a fluorescent indicator of the invention 
where.n the indicator includes a sensor polypeptide, or fragment thereof which 
normally has two or more states or conformational arrangements, and which 
undergoes a response during interaction with a chemical, biological, electrical or 
physiological parameter can be operat.vely linked to expression control sequences 
"Operative* linked" refers to a juxtaposition wherein the components so described 
are ,n a relationship pennitt.ng them to function ,n their intended manner. An 
expression control sequence operative.y linked to a coding sequence is heated such 
that expression of the coding sequence is achieved under conditions compatible with 
the expression control sequences. As used herein, the term "expression control 
sequences" refers to nucleic acd sequences that reguh.te the expression of a nucle.c 
acd sequence to which i, is operat.vely linked. Expression control sequences are 
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operativelv linked to a nucleic acid sequence when (he expression control sequences 
control and reyutate the transcription and. as appropriate, translation of the nucleic 
acid sequence. Thus, expression control sequences can include appropriate 
promoters, enhancers, transcription terminators, a start codon (i.e.. ATG) in front of a 

' protein-encoding gene, splicing signals for introns, maintenance of the correct reading 
frame of that gene to permit proper translation of the mRNA, and stop codons. The 
term "control sequences" is intended to include, at a minimum, components whose 
presence can influence expression, and can also include additional components whose 
presence is advantageous, for example, leader sequences and chimeric partner 
sequences. Expression control sequences can include a promoter. 

By "promoter" is meant minimal sequence sufficient to direct transcription. 
Also included in the invention are those promoter elements which are sufficient to 
render promoter-dependent gene expression controllable for cell-type specific, tissue- 
specific, or inducible by external signals or agents; such elements may be located in 
the 5' or 3' regions of the gene. Both constitutive and inducible promoters, are 
included in the invention (see e.g., Bicter et a/. s 19S7. Methods in Enzymology 
153:516-544). For example, when cloning in bacterial systems, inducible promoters 
such as pL of bacteriophage . plac, ptrp ; ptac (ptrp-lac hybrid promoter) and the like 
may be used. When cloning in mammalian cell systems, promoters derived from the 
genome of mammalian cells (e.g.. metalloihionein promoter) or from mammalian 
viruses (e.g., the retrovirus long terminal repeat; the adenovirus late promoter; the 
vaccinia vims 7.5K promoter; CV1V promoter) may be used. Promoters produced by 
recombinant DNA or synthetic techniques may also be used to provide for 
transcription of the nucleic acid sequences of the invention. 

The term ' fluorescent protein.'* and "fluorescent protein moiety" are used 
interchangeably and refer to any protein capable of emitting light when excited with 
appropriate electromagnetic radiation, and which has an amino acid sequence that is 
either natural or engineered and is derived from the amino acid sequence of an 
,4 e-^o/e'a -related fluorescent protein. The term "fluorescent indicator" refers to a- 
fluorescent protein having a sensor polypeptide whose emitted light vanes with the 
response state or conformation of the sensor polypeptide upon interaction with a 
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chemical, biological, electrical or physiological parameter. The term also refers to a 
fluorescent protein whose ammo ac.d sequence has been circularly permuted. The 
fluorescent indicators of the invention are also sensitive to pH in the range of about 5 
to about 10. Thus, the invention provides, lor example, a functional engineered 
fluorescent protein indicator whose amino acid sequence is substantially identical to 
the 238 amino acid Aequorea victoria green fluorescence protein (SEQ ID NO:3). 

The invention also includes functional polypeptide fragments of a fluorescent 
indicator. As used herein, the term -functional polypeptide fragment" refers to a 
polypeptide which possesses biological function or activity which is identified 
through a defined functional assay. The term "functional fragments of a functional 
engineered fluorescent protein" refers to fragments of a functional engineered protein 
that retain a function of the engineered fluorescent protein, e.g.. the ability to 
fluoresce in a manner which is dependent upon interactions of a chemical, biological, 
electrical or physiological parameter with a sensor polypeptide over the pH range 5 to 
10. 



Minor modifications of the functional engineered fluorescent indicator may 
result in proteins which have substantially equivalent activity as compared to the 
unmodified counterpart polypeptide as described herein. Such modifications may be 
deliberate, as by site-directed mutagenesis, or may be spontaneous. All of the 
polypeptides produced by these modifications are included herein as long as 
fluorescence of the engineered protein still exists. 

By ^substantially identical" or "substantially homologous" is meant a protein 
or polypeptide that retains the activity of a functional engineered fluorescent 
indicator, or nucleic acid sequence or polynucleotide encoding the same, and which 
exhibits at least 80%, preferably 85%, more preferably 90%. and most preferably 95% 
homology to a reference amino acid or nucleic acid sequence. For polypeptides, the 
length of comparison sequences will generally be at least 16 amino acids, preferably 
at least 20 amino acids, more preferably at least 25 amino acids, and most preferably 
35 amino acids. For nucleic acids, the length of comparison sequences will generally 
be at least 50 nucleotides, preferably at least 60 nucleotides, more preferably at least 
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75 nucleotides, and most preferably I 10 nucleotides. The reference amino acid 
sequence or nucleic acid sequence is considered homologous if the reference ammo 
acid sequence is *0-95% homologous lo any ponton of the amino acid or nucleic acid 
sequence in question. For example, where a circularly permuted polypeptide 
sequence has been generated, the sequence will typically have an amino acid 
sequence wherein a curboxy terminal sequence is now more amino terminal than the 
original fluorescein protein. In such instances, the circularly permuted sequence is 
considered homologous because the carboxy terminal sequence is still present in the 
circularly permuted fluorescent protein even though it is now more N-temunal. 



By "substantially identical" is meant an amino actd sequence which differs 
only by conservative amino acid substitutions, for example, substitution of one ammo 
acid for another of the same class (e g. , valine for glycine, arginine for lysine, etc.) or 
by one or more non-conservative substitutions, deletions, or insertions located at 

15 positions of the amino acid sequence which do not destroy the function of the protein 
( e.g., assaved as described herein). Preferably, such a sequence is at least 85%, more 
preferably 90%. more preferably 95%. more preferably 98%, and most preferably 
99% identical at the amino acid sequence to one of the sequences of EGFP (SEQ ID 
NO:4). EYFP (SEQ ID NO:5). ECFP (SEQ ID NO:6). EYFP- V6SL/Q69K (SEQ ID 

20 NO:7). YFP HI4SC (SEQ ID NO:S), or YFP HI4SQ (SHQ ID NO:9). As discussed 
in the previous paragraph and more fully below, circularly permuted sequences fall 
within the definition of "substantially identical", tor example if one or more amino 
acids of a circularly permuted polypeptide sequence is changed as described herein. 

25 Homology is typically measured using sequence analysis software (e.g., 

Sequence Analysis Software Package of the Genetics Computer Group, University of 
Wisconsin Biotechnology Center, 1710 University Avenue, Madison. WI 53705). 
Such software matches similar sequences by. assigning degrees of homology to 
various subsiitutions. deletions, substitutions, and other modifications. Conservative . 

30 substitutions typically include substitutions within the following croups: glycine 
alanine: valine, isolcucinc. leucine: aspanic acid, glutamic acid, asparagme. 
glutaminc: serine, threonine: lysine, arginine: and phenylalanine, tyrosine. 



WO 00/7156'! 

^ PCT/US00/I368-1 
In some embodiments, the amino acid sequence of the protein includes one of 
the following sets of substitutions in the amino acid sequence of the Aequorea green 
fluorescent protein (SEQ JD NO:3): F64L/S65T/H23 1 L, referred tu herein as EGFP 
(SEQ ID NO:4); S65G/S72 A/T203 Y/H23 I L. referred to herein as EYFP (SEQ ID 
NO:5); S65G/V68L/Q69K/S72A/T203Y7H231L. referred to herein as EYFP- 
V68L/Q69K (SEQ ID NO:7); 

K26R/'FiS4L'S65T/Y66W/NM46I/M]53T/VI63A/Kl64H/H23IL. referred to herein as 
ECFP (SEQ IDNO:6). The amino acid sequences ofEGFP. EYFP, ECFP. and 
EYFP-V68L/Q69K are shown in Tables 5-S. respectively. The numbering of the 
amino acids conforms to that in native Aequorea GFP. Thus, the first serine is am 
acid number 2 even if a valine (amino acid no. la) has been inserted to optimize 
ribosorne initiation. Thus. F64L corresponds to a substitution of leucine for 
phenylalanine in the 64th amino acid following the initiating methionine 
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Table 5. EGFP Amino Acid Sequence (SEQ ID NO:4) 
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KQKJs'Gl 

KDPNEKRDHMVLLEFVTAAGITLGMDELYK 

Table 6. EVFP Amino Acid Sequence (SEQ ID N0:5) 
vivqKr.FFLFTGVVPlLVELDGDVNGHKFSVSCEGEGUATYGKLTLKFICTTG 

ipVPVVPT^ 

Y^TRAEV KF EGDTLVNRJ ELK.GI DFK.EDGN1 LG H KL ^ J 

Skqkng^kvnfkikhnmedgsvqla 

LSKDPNEKRDHMVLLEFVTAAGJTLGMDELYK 

Table 7. EYFP-V68L/Q69K Amino Acid Sequence (SEQ ID NO:?) 
Mv« K ft--»--l FTGVVPlLVELDGDVNGHKfSVSGEGEGDATYGKLTL-ICFICTTG^ 

^PVPWPTLV^ 
™\Vtr AEVKJEGDTLW 

25 KDPNEKRDHMVLLEFVTAAGITLGMDELYK 

Table 8. ECFP Amino Acid Sequence (SEQ ID NO:6) 
uuc , rF r, pTnwPILVELDGDVNGHRFSVSGEGEGDATYGKLTLKFICTTG 

kqkngi^ahfS 

KDPNEKRDHMVLLEFVTAAGITLGMDELYK 

In other embodiments, the amino acid sequence of the protein is based on the 
sequence of the wild-type A equora green fluorescent protein, but includes the 
substitution H 14SG (SEQ ID NO.8) or H 14SQ (SEQ ID NO:9). In specific 
embodiments, these substitutions can be present along with other substitutions, e.g., 
40 the proteins can include the substitutions S65G/V68L/S72A/H I48G/Q80R.T203Y 
(SEQ ID NO:8), which is referred to here.n as the "YFP H I48G mutant.' 
S65G/V6SL/S72.VH 148O/Q80R/T203Y. which is referred to here.n as the "YFP 
H 14SQ mutant" (SEQ ID NO:9>. os well as EYFP HI4SG (SEQ ID NO: 10, and 
EFP-H I4SQ (SEQ ID NO: 1 1 ). The am.no acid sequences of these mutants are 
4 5 shown ir. Tables 9-12. respectively. 



24 



10 



15 



WO 00/71565 

PCT/USOO/1 368-1 

Table 9. Amino Acid Sequence of YFP HI4SG (SEQ ID NO S) 

eiiipii= 

Table 1 0. Amino Acid Sequence of YFP I I 1 48Q (SEQ ID N09) 



jq Table I |. Ammo Acid Sequence of EYFP-HI4SG (SEO ID NO- 101 

Table 12. Amino acid Sequence of EYFP-H I4SQ (SEQ JD NO I 1) 



A 
A 



In some embodiments, the protein or polypeptide ,s substantially purified Bv 
"substantially pure protan or polypeptide" ,s meanr an functional engineered 
fluorescent po.ypept.de which has been separated from components which naturally 
accompany „. Typically, the protein or polypeptide is substantial pure when ,t is ' a , 
least 60%, by weight, free from the proteins and narurally-occumne or-anic 
mo.ecu.es with which „ is naturally associated. Preferably, the preparation is at least 
7.%. more preferably a, .east 90%. and most preferably at leas, 99% bv we,oht of 
the protein. A substantially pure protein may be obtained, for sample, bv extraction' 
from a natural source (,.*., a o.a„, cell,; by expression of a recombinant nucleic acid 
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encoding a functional engineered fluorescen: protein: or by chemically synthesizing 
the protein. Purity can be measured by any appropriate method, eg., those described 
in column chromatography, polyacrylamide ge! electrophoresis., or by HPLC analysis. 

5 A protein or polypeptide is substantially free of narurally associated 

components when it is separated from those contaminants which accompany it in its 
narural state. Thus, a protein or polypeptide which is chemically synthesized or 
produced in a cellular system different from the cell from which it narurally originates 
will be substantially free from its naturally associated components. Accordingly, 
10 substantially pure polypeptides include those derived from cukaryotic organisms but 
synthesized in E. Co/i or other prokaryotes. 

The invention also provides polynucleotides encoding the functional 
engineered fluorescent protein described herein. These polynucleotides include DNA. 

15 cDNA, and RNA sequences which encode functional engineered fluorescent proteins. 
It is understood that all polynucleotides encoding functional engineered fluorescent 
proteins arc also included herein, as long as they encode a protein or polypeptide 
whose fluorescent emission intensity changes as pH varies between 5 and 10. Such 
polynucleotides include naturally occurring, synthetic, and intentionally manipulated 

20 polynucleotides. For example, the polynucleotide may be subjected to site-directed 
mutagenesis. The polynucleotides of the invention include sequences that are 
degenerate as a result of the genetic code. Therefore, all degenerate nucleotide 
sequences are included in the invention as long as the amino acid sequence of the 
functional engineered fluorescent protein or derivative is functionally unchanged. 

25 

Specifically disclosed herein is a polynucleotide sequence encoding a 
functional engineered fluorescent protein that includes one of the following sets of 
substitutions in the amino acid sequence of the Acquorca green fluorescent protein 
(SEQ IDNO:3): S65G/S72A/T203Y /H23IL, 
30 S65G/V6SL/Q69K/S72A/T203Y/H23 1 L. or K26R/F64L/S65T Y66W7N 1461/ 
M153T/V 163A'Ts l l64H/H231L. In specific embodiments, the DNA sequences 
encoding EGFP. EYFP ? ECFP. EYF P- V6SL/Q69K. YFP H 14SG. and YFP H14SQ' 
are those shown in Table 13-20 (SEQ ID NOs: 1 2 to 1 9). respectively. 
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The nucleic acid encoding functional engineered fluorescent prote.ns may be 
chosen to reflect .he codon choice in the native A victoria coding sequence, or. 
alternatively, may be chosen to reflect the optimal codon frequences used in the 
organism in « h.eh the prote.ns be expressed. Thus, nucleic acids encoding a 
5 target functional engineered protein to be expressed m a human cell may have use a 
codon choice that ,s optimized for mammals, or especially humans. 

TabIc 13 - £GFP Nucleic Acid Sequence (SEQ ID KO:12) 
CGAGCTGOACGUCGACGlAAACGGCCACAAGTTCAGCGTr.Trrrrrr »r 

5 G £~ A GG ^ c g a t g c c a c ct a cg g c a a gctg ac cctg a a g ttc a tctg^ /[c 

CACCGGCAAGCTGCCCGTGCCCTGG^ 
10 ACGGCCTCCAGTGCTTCAGCCGCTACCrc^^ 

^CTTCAAGTCCGCCATGCCCGAAGGCTACGTC^ 

tt^caaggaggacggcaactacaagacccgcgccgaggtca^gt^ 

GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCG^ 
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAA^G^ 

° a ?? t " a tatcatggccga^ 

AAGA J CCGCCACAACATC GAGGACGGCAGCGTGCAGCTCG^^^ 
,5 C G .SSS^ 



3 



Table 14. EYFP Nucleic Acid Sequence (SEQ ID NO: 13) 

ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCC-VTCrTrxrT 
CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCG^ 
GGCGAGGGCGATGCCACCTACGGCAAGCTGACC^ 
CACCGGCAAGCTCCCCGTGCCCTGG^ 

cggcgtgcagtgcttcgcccgctacc^ 
t^ag^ 

J[CAAGGACGACOGCAAC^ACAAGACCCGCGCCGAGGTG'^AGTTCG^^r,^ 

cgacaccctggtgaaccgcatcgagctgaa 

ACGGCAACATCCTGGGGCACAAGCTGGAG™^ 

cgtctatatcatggccuacaagcagaagaacggc^^^ 
agatccgccacaacatcgaggacggcagcgtgcagctggccgaccS 
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Table 15 ECFP Nucleic Acid Sequence (SEQ ID NO: 14) 



ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT 
CGAGCTGGACGGCGACGTAAACGGCCACAGGTTCAGCGTGTCCGGCGAG 
- GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC 
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT 
GGGGCGTGCAGTGCTTCAC.rCGCTACCCCGACCACATGAAGCAGCACGAC 
TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGTACCATCTTC 
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG 

10 GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG 
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACATCAGCCACAA 
CGTCTATATCACCGCCGACAAGCAGAAGAACGGCATCAAGGCCCACTTCA 
AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC 
CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA 

15 CTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCG 
ATCACATGGTCCTGCTGGAGTTCGTC.ACCGCCGCCGGGATCACTCTCGGC 
ATGGACGAGCTGTACAAGTAA 

20 Table 16. EYFP-V6SL'Q69K Nucleic Acid Sequence (SEQ ID NO: 15) 

ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT 

CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG 

GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC 

25 CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTCGGCTA 
CGGCCTGAAGTGCTTCGCCCGCTACCCCGACCACATGAAGCAGCACGACT 
TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCAGCATCTTCT 
TCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG 
CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG 

30 ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA 
CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA 
AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC 
CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA 
CTACCTUAGCTACCAGTCCGCCCTGAGCAAAGACXCCAACGAGAAGCGCG 

35 ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC 
ATGGACGAGCTGTACAAGTAA 



Table 17'. Nucleotide Sequence of the VFP H14SG Coding Region (SCO ID NO: 16) 

40 

ATGAGTAAAGGAGAAGAACTTTTCACTGGAGTTGTCCCAATTCTTGTTGA 
ATTAGATGGTGATGTTAATGGGCACAAATTTTCTGTCAGTGGAGAGGGTG 
AAGGTGATGCAACATAC'GGAAAACTTACCCTTAAATTTATTTGCACTACTG 
GA^ AACTACCTGTTCCATGGCCAACACTTGTCACTACTTTCGGTTATGGTC 
45 TTCA ^TGCTTTGCAAGATACCCAGATCATATGA A ACGGCATGACTTTTTCA 
AG-\GTGCCATGCCCGAAGGTTATGT'rCAGGAAAGAACTATATTTTTCAAA- 
GATGACGGG AACTACAAG ACACGTGCTGAAGTCAAGTTTGAAGGTGATAC 
CCTTGTTAATAG AATCGAGTTAAAAGGTATTGATTTTAAAGAAG ATGGAA 
ACMTCTTGGACACAAATTGGAATACAACTATAACTCAGGCAATGTATAC 
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jo Table IS. Nucleotide Sequence of ,he YFP H148Q Coding Re C ,on (SEO ,D N017) 
AGAGTGCCATGCCCG^ 

GATGACGGGAACTACAAGAC^cpTfV^T^^^^^ A ^ A ^^^^ AA *^ 
CCTTGTTAATAGA ATrr ^7-™?? T0 . GTGAAGT CAAGTTTGAAGGTGATAC 

Table 19. N'ucleocide Sequence of the FVPP w usr /- j- 

H " OI ine ^ r r^-HiaSG Codiniz Region fSEO ID 
30 NO: 18) ~ ~ V1U 

ggcgagggcgatccc.a?ctacgS?a^^ 
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Table 20. Nucleotide Sequence of the EYFP-H 14SQ Coding Region (SEQ ID 

NO: 19i 

ATGCTGAGCAAGCCCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT 
5 CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG 
GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC 
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTCGGCTA 
CGGCGTGCAGTGCTTCGCCCGCTACCCCGACCACATGAAGCAGCACGACT 
TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTJ'CT 

10 TCA AGGACG ACGGCAACTACAAGACCCGCGCCGAGGTGA AGTTCGAGGG 
CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG 
ACGGCA ACATCCTGGGGC ACAAGCTGGAGTACAACTACAACAGCCAGAA 
CGTCTATATCATGGCCGACAAGCAGA AGAACGGCATCAAGGTGAACTTCA 
AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC 

15 CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA 
CTA CCTG AG CTACC A GTCCGCCCTG A GCA A AG A CCCCA A CG A GAAGCGCG 
ATCACATGGTCCTGCTGG AGTTCGTGACCGCCGCCGGGATCACTCTCGGC 
ATGGACGAGCTGTACAAGTAA 

20 The functional engineered fluorescent protein can also include a targeting 

sequence to direct the fluorescent protein to particular cellular sites by fusion to 
appropriate organellar targeting signals or localized host proteins. A polynucleotide 
encoding a targeting sequence can be ligated to the 5' terminus of a polynucleotide 
encoding the fluorescence such that the targeting peptide is located at the amino 

25 terminal end of the resulting fusion polynucleotide/polypeptide. The targeting 

sequence can be. e.g.. a signal peptide. In the case of eukaryotes. the signal peptide is 
believed to function to transport the fusion polypeptide across the endoplasmic 
reticulum. The secretory protein is then transported through the Golgi apparatus, into 
secretory vesicles and into the extracellular space or, preferably, the external 

30 environment. Signal peptides which can be utilized according to the invention 

include pre-pro peptides which contain a proteolytic enzyme recognition site. Other 
signal peptides with similar properties are known to those skilled in the art, or can be 
readily ascertained without undue experimentation.. 



35 In the present invention, the nucleic acid sequences encoding the fluorescent 

indicator or circularly permuted fluorescent protein of the invention may be inserted 
into n recombinant expression vector. The term "recombinant expression vector" 
refers to a plasmiu. virus or other vehicle known in the art that has been manipulated 
by insertion or incorporation of the nucleic ncid sequences encoding the chimeric 



30 



) 



WO 00/71565 

PCT/US00/I3684 

peptides or the invention. The expression vector typ, ca „y contains an oriein of 
rephcacon, a promoter, as we,, as specific genes which a„ow phenorvpic se.ect.on of 
the transformed ce„s. Vectors suuable for use ,n the present invention include but 
are no, limited to the T7-based express.cn vector for expression ,n bacteria 
5 (Rosenberg, „ Gene.56.,25 f ,987, the pMSXND expression vector, or adeno or 
vacc,n,a vira. vectors for expression in mamma.,an ce„s (Lee and Nathans J B,o, 
Chem., 263:352,. ,988, bacu.ovirus-uenved vectors for expression in msect ce„s 

cauliflower mosaic virv<; fiU\^ mu., . 1 
11 vin *s. umv, tobacco mosaic virus, TM V 

> The nucleic add sequences encoding a fluoresce™ indicator or circularly 

permuted fluorescent pro.e.n of the invemion can also include a localization sequence 
to direct the indicator to particular cellular sues by fusion to appropriate or.ane.lar 
targeting signals or localized host protc.n, A polynucleotide encodine a Ionization 
sequence, or signal sequence, can be l.ga.ed or fused at the 5' terminus of a 
polynucleotide encoding the fluorescence indicator such that the sitt „ a | peptide ,s 
located at the amino terminal end of the rcsuliine chimeric 

polynucleotide/polypeptide. ,„ the case of eukar. otes. the signa, peptide is believed 
to function to transport the chimeric polypeptide across the endoplasmic reticulum 
The secretory protem is then transported through the Go.g, apparatus, into secretory 
ves,c,es and into the extracellular space or. preferably, the externa, environment 
Signal peptides which can be utilized according to the invention include pre-pro 
peptides which contain a proteolytic enzyme recognition s„e. Other signal peptides 
w.th sumlar properties to those described herein are bow,, to those skilled in the art 
or can be readily ascertained without undue experimentation. 

The localization sequence can be a nuclear localization sequence an 
endoplasmic reticulum localization sequence, n peroxisome localization sequence a 
mitochondrial localization sequence, or a localized protein. Localization sequences' 
can be targeting sequences which are described, for example, ,n "Protein TaroetinT 
Chapter 35 of Stryer, L... Biochemistry (4th ed.). W.H. Freeman ,99. The " 
localization sequence can also be a localized protein. Some important localization 
sequences mclude those targeting the nucleus (KKK RK) (SE Q ID NC20) 

mitochondrion (amino terminal MLRTSS1.FTRR VQPSLFRNILRLQST (SEQ ID 



WO 00/71565 PCTAJS00/.3684 
NO:21);. endoplasmic reticulum (K-DEL (SEQ ID NO:22) at C-terminus. assuming a- 
signal sequence present at N-terminus;, peroxisome (SK.F at C-terminus). synapses 
(S/TDV or Tusion to GAP 43, kinesin and tau) prenylaiion or insertion inlo plasma 
membrane (CaaX (SEQ ID KO:23). CC, CXC, or CCXX (SEQ ID NO:24) al C- 
5 terminus), cytoplasmic side of plasma membrune (chimeric lo SNAP-25), or the Golgi 
apparatus (chimeric to furin). The constmction of expression vectors and the 
expression of genes in transfecled cells involves the use of molecular cloning 
techniques also well known in the an. Sambrook el al.. Molecular Cloning -- A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, (1989) 

10 and Current Protocols in Molecular Biology, F.M. Ausubel el al... eds , (Current 

Protocols, a joint venture between Greene Publishing Associates. Inc. and John Wiley 
& Sons, Inc., most recent Supplement). These methods include in vino recombinant 
DNA techniques, synthetic techniques and /// r/Ye> recombinat ion/genetic 
recombination. (See, for example, the technique? described in Mamatis, et al., 

1 5 Molecular Cloning A Laboratory Manual. Cold Spring Harbor Laboratory, N.Y., 
1989). 

Examples of agents which induce a sensor polypeptide, include agents that 
contain any of the amino acid . sequences in Table 21. or a portion thereof with the 

20 proviso that the parameter must bind to a calmodulin sensor polypeptide. The 

parameter can be a subsequence of □ calmodulin-binding domain. The moieties listed 
in Tabic 2 I are recognized by the sensor polypeptide CaM. See, tor example, Crivici, 
A. & lkura, M. Annu. Rev. Biophys, Biomol. Struct. 24:84-1 1 6 ( 1 995). The 
parameter can be modified to enhance the response of the fluorescent indicator to the 

25 parameter. Other parameter are known in the an for other sensor polypeptides. 

TABLE 2 I 



Target" 


Sequence 


skJVlLCK (VI 13) 


KRRWKKNF1AVSAANRF1CKJSSSGAL (SEQ ID 
NO:?5) 


smMLCK ismMLCKp) 


4RRKWQKTGHAVRA1GRLSS (SEQ ID N0 26) 


CaMKJI 


ARRKLKGAILTTMLATRNFS (SEQ ID NO:27) 


Caldesmon 


GVRNIKSMWEKGNVFSS (SEQ ID KO:28) 


Calspermin 


\ R RK L 10\ A V KA V V A S S RL G S (SEQ ID NO:29) 


PFK. (Mil.) 


'FMNMWEVYKLLAHIRPPAPKSGSYTV (SEQ ID . 
NO:30> 
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!"(PhKl3).. 

I 

~LV -ATPase(C28VV) 




Borderel/a pertussis AC 



NeuromoduJin 




Glucagon 



Secretin 



w ^rvivivu. lOtU 1U :\U 49) 



VIP 



i ID NO:5 1 ) 



GJP 



NO:53) 



10 



- v. ■ , d ^ (SEQ ID NO :54) 



Abbreviations: AC, adenvlyl cyclase- BBMHPI hr,,.h u , 
CaMKJI. calmodulin kmase h 'cBp"' cflmodu i ? w" " my0S,n heaV >' chain -<-: 
inh.bi.or>- pept.de; HIV-1 epl 60 L man ''1^'^ Peptide - 2: GIP ' P^n 

glycoprotein 1 60: HSP, hea'shoc! V p™d N M n'* ^ ' 

kmase substrate: MHC. myosin heavy chain" NOS n,^ y "T J a,aminle -"--h C 
Phosphodiesterase; PFK. phosphofruaokmase Ph^K nhn °'h , ynthase; PDE <- 
smMLCK. skeletal muscle- and ,mn,, h ^' P h °sphorylase kinase: sk-, 
vasoaenve in,es,inal pepte " mu " ,e - m >™" ^ chain k.nase: V,P. 
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Where linker moiety is present, the length of the linker moiety is chosen to 
optimize the kinetics and specificity of responsiveness of the sensor polypeptide 
induced by the interaction of the chemical, biological, electrical or physiological 
parameter xviih the sensor polypept.de. The linker moiety should be long enough and 
flexible enough to allow the sensor polypeptide to freely interact and respond to a 
particular parameter. The linker moiety is, preferably, a peptide moiety. The 
preferred linker moiety .s a peptide beiween about one and 30 amino acid residues in 
length, preferably between about two and 1 5 amino acid residues. One preferred 
linker moiety is a -Gly-Gly- linker. The linker moiety can include flexible spacer 
amino acid sequences, such as those known in single-chain antibody research. For 
example, the linker moiety can be GGGGS (SEQ ID NO:56)(CGGGS)n, 
GKSSGSGSESKS (SEQ ID NO:5?>. GSTSGSGKSSEGKC (SEQ ID NO:58), 
GSTSGSGKSSEGSGSTKG (SEQ ID NO:59) GSTSGSGKSSEGKC (SEQ ID 
NO:60). GST SGSGKPGSGEGSTKG (SEQ ID NO:61). EGKSSGSGSESKEF (SEQ 
ID NO:62). GGTGEL (SKQ ID NO: 1 ), FKTRHN (SEQ ID NO:2), or GGTGGS 
(SEQ ID NO:63). Linking moieties are described, for example, in Huston, J.S., et al.. 
PNAS 85:5879-5883(1988), Whitlow, M.. el al., Protein Engineering 6:989-995 
(1993), and Newton. D.L.. el al.. Biochemistry 35:545-553 ( 1996). 

Depending on the vector utilized, any of a number of suitable transcription and 
.ranslation elements, .ncludtng constitutive and inducible promoters, transcription 
enhancer elements, transcription terminators, etc. may be used in the expression 
vector (see..-'.. Bitter, ei al. , Methods in Enzymology 153:516-54-4, 1987). These . 
elements are well known to one of skill in the art. 

In bacter.al systems a number of expression vectors may be advantageously 
selected dcpend.ng upon the intended use. For example, when large quantities of a 
protein of the i nvention is des.red, vectors which direct the expression of high levels 
of chimenc protein products that arc readily purined may be desirable. Those which 
rcrcd to conta.n a cleavage site to aid in protein recovery are preferred. 



arc cncmccr 



In yeas:, a number of vectors containing constitutive or inducible promoters 
may be used. Lor a renew see. Current Protocols in Molecular Biology. Vol. 2, Ed. 
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,nve„„„ n m , v bc cloned m , 0 non . esscmia] . esions (fQr « 

or , 5 „ rus Md phwd under co „, rol of an AcNpy promn|cr y ^, 

o >.,cd r ,„ promoler , Succ „ sfl sen|M of ^ scquenccs . 

,:i;;;r v 1 ^ — - - — i . f l 

occluded recombinant virus (i >- vinn h-L, n ,i 

,u, '■<- slacking the protemaceous coat coded for bv the 

40.584, 98.i; Smith, U.S. Patent No. 4 2 1 05 1 A nnth 

Another alternative expression 

:r:rr p,an ' ce " ^ — ~™ - ~ 

with recombinant plasmid expression vectors (<; «. T.nhc n 

i \euors (t.^., 1 1 plasmid) containing a 

fluorcscen, indicator or circularly permuted fluorescent protein. 

■n a ce,^r nSf0nnat,0n " ' $ mCam ° ~" °' ~ ^ ^ induced 
" ovv,ll S 'ncorporation of new DNA f / DKA exogenous to the cell). 
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Where the cell is a mammalian celL a permanent genetic change is generally achieved 
by introduction of the DNA into the genome of the cell. 

Dy "transformed cell" is meant a cell into which (or into an ancestor of which - 
has been introduced), by means of recombinant DNA techniques, a DNA molecule 
encoding a fluorescent indicator or circularly permuted fluorescent protein having an 
optically active polypeptide having opcratively inserted therein a sensor polypeptide, 
or fragment thereof, which normally has two or more states, and which is affected by 
a chemical, biological, electrical or physiological parameter. 

Transformation of a host cell with recombinant DNA may be carried out by 
conventional techniques as are well ltnown to those skilled in the an. Where the host 
is prokaryotic. such as E. Coli, competent cells which are capable of DNA uptake can 
be prepared from cells harvested after exponential growth phase and subsequently 
treated by the CaCh method by procedures well known in the an. Alternatively, 
MtrCb or RbCl can be used. Transformation can also be performed after forming a 
protoplast of the host cell or by electroporation. 

When the host is a eukaryote, such methods of transfection of DNA as calcium 
phosphate co-precipitates, conventional mechanical procedures such as 
microinjection, electroporation, insenion of a plasmid encased in liposomes, or virus 
vectors may be used. Eukaryotic cells can also be cotransfected w ith DNA sequences 
encoding the chimeric polypeptide of the invention, and a second foreign DNA 
molecule encoding a selectable phenotype. such as the herpes simplex thymidine 
kinase gene. Another method is to use a eukaryotic viral vector, such as simian virus • 
40 (SV40) adenovirus, vaccinia virus, or bovine papilloma virus, to transiently infect 
or transform eukaryotic cells and express the protein. (Eukaryotic Viral Vectors. Cold 
Spring Harbor Laboratory, Gluzman ed.,.l9S2). Preferably, a eukaryotic host is 
utilized as the host cell as described herein.. Methods ofstable transfer, meaning that 
the foreign DNA is continuously maintained in the host, are known in the an. 



Eukaryotic systems, and preferably mammalian expression systems, allow for 
proper post-translational modifications of expressed mammalian proteins to occur. 
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Eukaryotic cells which pusses* ihc cellular machinery fur proper process ing^^he'' 3684 

ZirT'f^'''' 0 "- Ph ° SPh °- ia "- «* ~ ««. proper 
M he used as hos, cells f „r ,he express.cn onWesce,,, ^ Such hos , 

Imes may mclude b„, are „„, , im!lcd ,„ CHO. V«0. BHK. HeLa. COS MDCK 
= Jurkal, HEK-293, and WI38 ,MUCk ' 

Mammalian cell systems vi n. ^K ,.» r , 

* e '" S " l,R ' ,U " ll "' t ™'" b ">«'" erases or v,r„l elemems 
drrec, express,,, may be engineered F„, example, „„e„ „s,„ K adenovinis 
express on ,ec,or, ,he „„c,e,c ac,d sciences e ,,od,„ s a IWre s ce dinmr or 

T y Pe,mU,ed n " 0reS " m P, ° ,ein ™ — ^ »«.cd ,o a„ 
adenov.rus „anscr,p„o„„ ra „ s ,a,lo n eonrro, complex. ,„ e 

■npamre leader seouence. This nnele.c acid sequence may „,e„ he .nsened in t he 
.denov,™ g e„on,e hy ,„ ,„,,, „ , „ 

,T ' V ";' "*> ^" *< » ™» . recomo.nan, I 

ha, ,s v.ab.e and capable „rex P re S s,„ g ,he fluoresce™ ,„„ica,o r ,„ , nf „, cd hM|J 

* LOeM & Shenk ' A-- USA. „ 365J-36<9 , M „ 

Altemativdy, the vaccin.a v.rus 7 .5K promo.e, may he used SC e. Macxer, „ 

particular , n ,e f es, are ,ec,„rs based on b„,,„e P a P ,„„ma vinK „,, t , ha,e ,he a _ h ,„y 
o pl.cate as ex.rnehrom^ma, elemen.s ,Sa r ,, „ M ,„ CeM . Bl0| , 
mi). Shortly afrer enoyoftw, DN'A ,„,„ mouse cc„ s . the plasm.d repl.ea.es ,o • 
^, ,00 ,o 200 cop.es per ee„. Tra„ser,p„o„ o, the .nsencd cD\A does no, re" ,re 

Ta e he 7T ^ ^'^ *">"**"' * ■ 

marxe ,„ the p.asm.d. such as ,he neo g e„e. A ,,crna,,,e,y, ,he re.rov.ra, e enome can 

nuorescen, ,„d,c a ,or g e„e ,„ hos, cells ,Co„e 4 Mu„, g a„. Pr0 , Kal , ^ & .. 
USA, 81:6349-6353, ,984, „,„ ,eve, express™ may a,so be ach.eved us,„. 
■n^ble prom„,ers, including he, no, „m„ed ,o. ,he me,a„o, h ,o„,„e „ A ^ 

and heat shock promorers. 
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For long-term, high-yield production of recombinant proteins, stable 
expression is preferred. Rather than using expression vectors which contain viral 
orieins ol replication, host cells can be transformed with the cDNA encoding a 
fluorescent indicator or circularly permuted fluorescent protein of the invention 

5 controiled by appropriate expression control elements {e.g.. promoter, enhancer 
sequences, transcription terminators, polyadenylation sites, ctc j. and a selectable 
marker. The selectable marker in the recombinant plasmid confers resistance to the 
selection and allows cells to stably integrate the plasmid into their chromosomes and 
grow to form foci which in rum can be cloned and expanded into cell lines. For 

10 example, following the introduction of foreign DNA. engineered cells may be allowed 
to grow for 1-2 days in an enriched media, and then are switched to a selective media. 
A number of selection systems may be used, including but not limited to the herpes 
simplex virus thymidine kinase (Wislcr. el at.. Cell. 1 1 : 223. 1977). hypoxanthine- 
guanine phosphonbosyltransferase (Szybalska & Szybalski. Proc. Natl. Acad. Sci. 

1 > USA. 4S:2026. 1962), and adenine phosphonbosyltransferase (Lowy, ct al.. Cell. 22: 
817, 19S0) genes can be employed in tk-, hgprt- or aprt- cells respectively. Also, 
antimetabolite resistance can be used as the basis of selection for dhfr. which confers 
resistance to methotrexate (Wigler, el al.. Proc. Natl. Acad. Sci. USA. 77: 3567, 1980: 
CTHare.tv a/.. Proc. Natl. Acad. Sci. USA, 8: 1527, 1981): gpt. which confers 

20 resistance to mycophenolic acid (Mulligan & Berg, Proc. Natl. Acad. Sci. USA. 78: 
2072. I9SI; neo, which confers resistance to the aminoglycoside G-4IS (Colbenre- 
Garapin. ei al.. J. Mol. Biol., 150: 1 , 1981); and hygro, which confers resistance to 
hygromycin(Santerre.era/.,Gene,.30: 147. 1984) genes. Recently, additional 
selectable genes have been described, namely trpB. which allows cells to utilize • 

25 indole in place of tryptophan; hisD, which allows cells to utilize histinol in place of 
histid.ne (Hanman & Mulligan, Proc. Natl. Acad. Sci. USA. 85:8047. I98S); and 
ODC (ornithine decarboxylase) which confers resistance to the ornithine 
decarboxylase inhibitor, 2-(dinuoromcthyl)-DL-ornithine. DFMO (McConloguc L.. 
In: Current Communications in Molecular Biology. Cold Spring Harbor Laboratory. 

30 ed.. I9S7). 

A fluorescent indicator or circularly permuted fluorescent protein of the 
invention can be produced by expression of nucleic acid encoding the protein in 
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prCcaryo.es. Th,s t ,nc,„de bu, are no, Itai-d ,o micrporgan.sms such as „ acrer , a 
■ransformed ,n,h reeombtan, bac.er.ophage DN'A, phsmid DKA „r cosmi<f DNA 
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advance „f, ne op , lca% ac „ vt Dol ,,, tp „ d „ of , he i|n , en , s [hat ^ 

» prepared by norma, pro.e.n biosyn.hes,, , Ilus avoid.ng organ.c sv„, hes is and lhe 
re q u,rcn,=n, for cus.om.zcd unnarura, a m ,„o acd analogs. The cnsm.es can be 
expressed ,„ * Co,, in , arg e sca,e for ,„ ,*„ assay , Pur ,„ calion ^ ^ js 
s,mp„r,ed ,v he „ ,„ e sconces ,„ c|udc ^ ^ 

chromatography. The cons,™ can a,so con.a.n a ,ag ,„ Slrnp ,„v , solat , 0 „ of ,„„ 
fluoresce ind.ca.or. For example, a polyliis.idine lag of, e.g., six histidinc residues 
can be ,„eorpora,ed a, ,hc amin0 Kmtmt cnd of , he ^ 

poryh,s,,d,„e ,ag aUows convcn.cn, ,sda,ion of ihc pro.cn ,„ „ si„ 2 ,e s.cp bv nicke! 
ehela.e cr,ron,a,o g rap„y. AUcmanveh, , hc subs.ra.es can be exposed d.rcaiv ,„ . 
desired host cell for assays in siru. 

Techniques for the isolaiion and purification of either microbial* or 
eukaryoucally expressed polypeptide of the invention may be by anv conventional 
means such as, for example preparative chromatographic separations and 
.mmunological separations such as those involving the use of monoclonal or 
polyclonal antibodies orantieen. 

It should be understood that a nucle.c acd sequence can also function as the 
concentration or available parameter ,n another embodiment of the invention For 
example, a response may result from the interaction of a nucleic acd sequence with a 
sensor polypeptide comprising a DNA binding protein motif. 

Screening A^pys 

The invention features a method for determining the presence of a chemical 
b.o.ogical. electrical or physiological parameter, by contacting the sample with a ' 
fluorescent indicator or circularly permuted fluorescent protein of the invention- 
e.xc,t,ng the indicator or protein: and measuring the amount of an optical property of 
the indicator or prote.n in the presence and absence of a parameter, such that a change 
«n the optica, property is indicative of an affect of the parameter on the indicator or 
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protein. A scries of standards, with known levels of activity, can be used to generate 
a standard curve. The optical event, such as intensity of fluorescence, that occurs 
followinu exposure of the sample to the fluorescent indicator or protein is measured, 
and the amount of the optical propeny is then compared to the standard curve. A 
standard, with a known level of activity, can he used to generate a standard curve, or 
to provide reference standards. The optical event., such as fluorescence, that occurs 
following exposure of the sample to the fluorescent indicator or protein is measured, 
and the amount of the optical property (e.g., fluorescence) is then compared to the 
standard in order to generate a relative measure of the affect of the sample on the 
fluorescent indicator. 

In another embodiment, the invention features a method for determining if a 
cell exhibits an activity, which includes transfecting the cell with a nucleic acid 
encoding a fluorescent indicator or circularly permuted fluorescent protein of the 
invention; exciting the fluorescent indicator or circularly permuted fluorescent . 
protein; and measuring the amount of an optical property in the presence of the 
activity and in the absence of the activity, such that a change in the optical property is 
indicative of activity. Typically, the optical propeny is calibrated against standard 
measurements to yield an absolute amount of protein activity. 

The invention additionally, features methods for determining transient changes 
in a chemical, biological, electrical or physiological parameter, by contacting the 
sample with a iluorcsccnt indicator or circularly permuted fluorescent protein of the 
invention and measuring a change in the optical property of the indicator over time. . 

It is understood that the cell containing a nucleic acid sequence encoding a 
fluorescent indicator or circularly permuted fluorescent protein of the invention can 
be used to co-transfect other genes of interest in order to determine the effect of the 
gene product of that gene on the cell or the sensor polypeptide of the fluorescent 
indicator or circularly permuted fluorescent protein. Therefore, a cell containing such . 
a nucleic acid sequence is a composition provided by the present invention. 
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The invention can be used in screening assays to determine whether a 
compound (e.^.. a drug, a chemical or a biologic) alters the activity of a particular 
protein, i.e., the sensor polypeptide U'.g., ligand binding to a receptor). In one 
embodiment, the assay is performed on a sample containing the chimeric protein in 
vitro. A sample containing a known amount of activity, such as an enzymatic 
activity, is mixed with a fluorescent indicator substrate of the invention, with the co- 
factors required for activity , and with a test compound. The amount of the enzyme 
activity in the sample is then determined by measuring the amount of an optical 
property, such as a fluorescent property, at least a fust and second time after contact 
between the sample, the chimeric protein substrate of the invention, and any co- 
factors or components required to conduct the reaction, and the test compound. Then 
the amount of activity per mole of enzyme, for example, in the presence of the test 
compound is compared with the activity per mole of enzyme in the absence of the test 
compound. A difference indicates that the test compound alters ihe activity of the 
enzyme. In general a change in the optical parameter by any measurable amount 
between activity in the presence of the test compound as compared with the activity in 
the absence of the test compound : is indicative of activity. 

In another embodiment, the ability of a compound to alter the activity of a 
particular protein (i.e., a sensor polypeptide) in vivo is determined. In an in vivo 
assay, .cells transfected with a expression vector encoding a substrate of the invention 
are exposed to different amounts of the test compound, and the effect on the optical 
parameter, such as fluorescence, in each cell can be determined. Typically, the 
difference is calibrated against standard measurements to yield an absolute amount of 
protein activity. This provides a method for screening for compounds which affect 
cellular events (e.g., receptor-ligand binding, protein-protein interactions or protein 
kinase activation). In a given cell type, any measurable change between activity in 
the presence of the test compound as compared with the activity in the absence of the 
test compound, is indicative of activity. 

The materials of the invention are ideally suited for a kit for determining the 
presence of an activity in a sample. Such a kit may contain a container containing a 
chimeric protein comprising an optically active polypeptide having operatively 
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micrometers in diameter which allows reproducible injection of 1-2 pi of DNA 
solution. The use of zygotes as a target for gene transfer has a major advantage in that 
in most eases the injected DNA will be incorporated into the host gene before the first 
cleavage (Brinsier ef a/., Proc. Nail. Acad. Set. USA 82:4438-4442, 1985). Asa 
consequence, all cells of the transgenic non-human animal will carry the incorporated 
transgene. This will in general also be reflected in the efficient transmission of the 
transgene to offspring of the founder since 50% of the germ ceils w ill harbor the 
transgene. Microinjection of zygotes is the preferred method for incorporating 
transgenes in practicing the invention. 



The term "transgenic" is used to describe an animal which includes exogenous 
genetic material within all of its cells. A "transgenic" animal can be produced by 
cross-breeding two chimeric animals which include exogenous genetic material ■ 
within cells used in reproduction. Twenty-five percent of the resulting offspring will 
15 be transgenic i.e., animals which include the exogenous genetic material within all of 
their cells in both alleles. 50% of the resulting animals will include the exogenous 
genetic material within one allele and 25% will include no exogenous genetic 
material. 

20 Viral infection can also be used to introduce transgene into a non-human 

animal (e.g.. retroviral, adenoviral or any other RN A or DNA viral vectors). The 
developing non-human embryo can be cultured in vitro to the blastocyst stage. During 
this time, the blastomeres can be targets for retro viral infection (Jaenich, R.. Proc. 
Natl. Acad. Sci USA 73:1260-1264, 1976). Efficient infection of the blastomeres is 

25 obtained by enzymatic treatment to remove the zona pellucida (Hogan, et ai { 1986) in 
Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y.). The viral vector system used to introduce the transgene is typically a 
replication-defective retrovirus carrying the transgene (Jahncr, et al. . Proc. Natl. 
Acad. Sci. USA 82:6927-693 1 ; 1985; Van dcr Putten, et aL, Proc. Natl. Acad. Sci 

30 USA 82:6148-6152, 1985). Transaction is easily and efficiently obtained by 

culturing the blastomeres on a monolayer of virus-producing cells (Van der Putten, 
supra; Stewart, at al.. EM BO J. 6:383-388. 1987). Alternatively, infection can be 
performed at a later stage. Virus or virus-producing cells can be injected into the 
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rnav be expressed in a transgenic non-human animal. The term "transgenic" as used 
herein additionally includes any organism whose genome has been altered by in vitro 
manipulation of the early embryo or fertilized egg or by any transgenic technology to s 
induce a specific gene knockout. The term "gene knockout" as used herein, refers to 
5 the targeted disruption of a gene in vivo with complete loss of function that has been 
achieved by any transgenic technology familiar to those in the art. In one 
embodiment, transgenic animals having gene knockouts are those in which the target 
gene has been rendered nonfunctional by an insertion targeted to the gene to be 
rendered non-functional by homologous recombination. As used herein, the term 
10 "transgenic" includes any transgenic technology familiar to those in the art which can 
produce an organism carrying an introduced transgene or one in which an endogenous 
gene has been rendered non-functional or "knocked out." 

Methods For identifying Inser tion Sites 
15 a number of methods for identifying insertion sites in fluorescent proteins, 

such as GFP ? YFP and CFP are known in the art, including, for example/site directed 
mutagenesis, insertional mutagenesis, and deletional mutagenesis. Other methods, 
including that exemplified below and in the Examples, are known or easily 
ascertained by one skilled in the an (see, for example, Abedi et al. 7 supra). 

20 

Sites in, for example, GFP mutants which can tolerate insertions of sensor 
polypeptides can be identified by generating mutant proteins by manipulating the 
DNA sequence such that a variety of different insertions are produced and screening 
the mutants by flow cytometry for mutants which retain fluorescence. Such insertions 
25 may include replacement of certain amino acids, as well as the addition of a new 
sequence without a corresponding deletion or replacement in the sequence of the 
fluorescent protein. Variants identified in this fashion reveal sites which can tolerate 
insertions while retaining fluorescence. 

30 . Additionally, circularly permutation techniques are also useful in-identifying • 

sites in fluorescent proteins which are capable of tolerating insertions while retain the 
ability to fluoresce. Such techniques include are exemplified herein as well as known 
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EXAMPLES 

Cells and Protein Purification: 

Bacteria used in this study were BL2 1(DE3) Gold cells from Stratagene. 
Transformation was performed by electroporating cells suspended in 10% glycerol 
5 directly with a ligation mixture (0. 1cm cuvette. 12.5 kV/cm, 200 Q. 25uFd). For 
protein expression, cells were grown in LB containing 100 mg/L Ampicillin to. an 
OD 6 oo of 0.6, at which time they were induced with 1 mM Isopropylthiogalactoside. 
Bacteria were allowed to express recombinant protein for 6 hours at room temperature 
and then overnight at 4 C C. The bacteria were then pelleted by centrifugation, 
10 resuspended in 50mM Tris, 300mM NaCI and lysed with a French Press. The 

bacterial lysates were centrifuged at 30 : 000xg for 30 minutes, and the supernatants 
were incubated with NiNTA resin (from Qiagen. used for purifying circularly 
permuted GFP and Calmodulin Insertions) or Cobalt Talon Resin (from Clontech, 
used for purifying zinc-finger inserts). 

15 

Cloning and Gene Construction: 

Yellow GFP mutants (YFPs) with peptide insertions replacing Y 145 were 
made by performing two separate polymerase chain reactions (PCRs). The first PCR 
amplified the N terminal piece of YFP to include a 5' Bamrll sue and 3' replacement - 

20 of Y145 with the hexapeptide linker GGTGEL (coded for by DNA containing Kpnl 
and Sacl restriction sites for subsequent cloning). The second PCR amplified the C 
terminal piece of YFP to include the 5" linker (GGTGEL) replacing Y*145 and a 3' 
EcoRI site. These two PCR products were combined, amplified with N and C 
terminal YFP primers to yield a full length cDNA containing the insertion. The full 

25 length cDNA was restricted with BarnH 1 and EcoR 1 , ligated and cloned into the 
BamHl and EcoRI sites of pRSET B (lnvitrogen) to yield the plasmid pYFPins. 
Next, the cDNA's for Xenopus Calmodulin and the first zinc-finger motif from zif26S 
were amplified with PCR using primers containing 5' Kpni sites and 3* Sacl sites and 
digested with Kpnl and Sacl. Finally, insertions into YFPs were made by cloning 

30 cDNAs of inserted proteins in between the KLpnl and Sacl sites of pYFPins. (Figure 

1). 
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absorbing band represents (he proionaied chromophore and the fluorescent, blue- 
absorbing band represents the deproionated chromophore, these data show that 
calcium, binding alters the equilibrium between protonuted and deprotonuted 
chromophore states, i.e. changes the pKa of fluorescence. 

5 

YFPs containing a zinc-finger motif derived from zi f26S also increase in 
fluorescence on binding zinc. The change in fluorescence for Zn-sensing YFPs is 
substantially less than that of Calmodulin-YFPs described above. First, as Zn finger 
motifs contain cysteine residues in close proximity, they can be prone to oxidation, 

10 which would prevent zinc binding. Second, the change of inserted proteins on 

substrate binding is responsible for the change in pKa and therefore the change in 
fluorescence. It could be possible that zinc-finger motifs adopr a non-optimal 
conformation on binding zinc, leaving the chromophore still vulnerable ro 
protonation. Last, each of the three reasons listed above concerning'the perfonnance 

15 of YFP-Calmodu!in Insertions could in principle also apply analogously to the zinc 

finger insertions. As stated above, the zinc-finger inserted YFP reported here is just a 
first generation prototype of what will likely become a powerful new class of 
indicators. 

20 Circular Permutations: 

Circular Permutations of GFP mutants with a new N terminus at Y 1 45 were 
made by performing two separate PCRs. The first PCR amplified the C terminal 
piece of a GFP mutant (N terminal in the final permuted gene) to have a 5' BamH 1 
site, the mutation Y145M, and a 3' hexapeptide linker (GGTGGS) containing a Kpnl 

25 site. The second PCR amplified the N terminal piece of the GFP mutant (C terminal 
in the final permuted gene) to have a 5' hexapeptide linker (GGTGGS) containing a 
Kpnl site, and- a 3' EcoRI site. The first PCR product was digested with sequentially 
with BamH I and Kpn 1 , the second PCR product was digested sequentially with 
EcoRI and Kpnl, and the fragments were purified by agarose gel electrophoresis. 

30 The N and C terminal PCR fragments were then cloned in a three pan ligation into the 
BamH 1/EcoR I sites of pRSET B. (See Figure 5). 
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■hree downstream s.op codons ,n each read,„ s frame between d,e BamHI and EcoR , 
-« of pRSET B. ™, vecor CpRSET triple stop-, was d, S es.e d w„„ E coRV 
"eated ,„-,„ A*a„„e Phosphatase. an d p ur ,fie d by agarose „e, e.ecropnoresis. ' 

^ de T ' ° f drc " l3r P™— te -cu,„,y permuted GFP g<me 

described above was amoved by PCR with p r ,mers lhal „ ealeij , ^ pQR S 

2 TT t7 end '" S 31 " X "°' <aC8aS) "">" - 
' 5 P ™ Pr0dl,a ™ aS diStSIeJ »" h »"« Coned i„,„ , ne Xho, she of 
pBluescnp,. T h ,s plasm.d was a mp „ned i„ ba c, t ,,a. pu„„ed w„ h , Qlaaen mK1 . 
prep, di^e^r, Xbol. and „,e -HObp fravmen, was agnrose gel-puri^d ,o v.eld 
a .,near, 2 ed GPP g e„e, Tbe „„ea, fragmen, >v as cuc„,ar 12 ed a, a concemranon of , 

7 ; ,th 8000 U '"' L « ™* ^ E"-^»d B,o,abs, ovem iuh , a , 16x ' 
Afier e„, a „o, precipitation, ,l,e DNA was d.ges.ed „„ h DNAse (100 U/'m. DNA 
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Pharmacia) for 15 minutes at room temperarure in 50mM Tris.HCI. pH7.5 and ImM 
MnCI2. Digescion was stopped by phenol extraction, then subsequent 
phenol/chloroforrrvlsoamyl Alcohol and Chloroform/Isuamyl alcohol extractions. 
The DNA was ethanol precipitated, resuspended in lx synthesis buffer (Stratagene) 
and incubated with T7 DNA polymerase (Stratagene) and T4 DNA ligase (Stratagene) 
at room temperature for 1 hour to repair DNA nicks and fill sticky ends. (Figure 6). 

The linear, repaired, randomly permuted DNA library was purified by agarose 
gel electrophoresis, ligated blunt into the pRSET triple stop expression vector, and 
electro-transformed into BL21 bacteria. 

LB/agar plates containing ampicillin usually displayed a few thousand 
colonies per plate and were screened by digital imaging of fluorescence. The plates 
were illuminated with a 1 50 W xenon arc lamp through n 450-490 nm bandpass fdter 
and a pair of fiber optic light guides (Oriel Instruments) positioned to illuminate the 
top surface of the agar as evenly as possible. The emitted fluorescence was selected 
by a 510-550 nm bandpass filter and focused by a 171 .2 camera lens (Nikon) onto a 
cooled charge-coupled-device camera (Photometries). Digital images from the camera 
were analyzed with Metafluor software (Universal Imaging Corp.). Out of 
approximately 25,000 bacterial colonies screened, about 200 became fluorescent after 
24 hours at 4°C, and 144 of these were picked for plasmid minipreps and restriction 
analysis. All piasmid minipreps were digested with Hindlll and Kpnl to analyze the 
site of permuted termini (Hindlll cuts 3' to the GFP gene in pRSHTB, Kpnl cuts at 
the linker between N and C termini of GFP). Clones which gave restriction fragments 
of~750bp or no visible fragments from I00-1000bp were considered to be 
regenerations of nearly wild-type sequence and were not investigated further. Clones 
which gave restriction fragments between J 00 and I OOObp but not 750 bp, were 
sequenced at their N and C termini to pinpoint the exact locations of new termini 
within the GFP'sequence. 

GFP forms a fluorescent circularly permuted protein when its native N and C 
termini are connected with the hexapeptide linker GGTGGS and new N termini are 
formed at E142, Y143, Y145, HI48, D 1 55, HI 69, EI72, D173. A227, or 1229 (Sec 
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r* " l h "' PCm,,, * Cd Pr0 ' e,n " ilh *• * -VMS was m . lde and smdled 

for ,„ t Cyan, r.rccn. and Ycl.o.v „,„,„„„ of OFP (cpCFP, cpGFP, cpYFP. ,„ each 
case ,h, pr „, e ,„ had a pKa of „ uorescen „ ^ jo ^ ^ 

nUUrK " nCe *« — '-""- ThU ,ha, ,„,e mpting GFP 

and „s mM11B a, y,« g e „e rally ,„„„.,« *. chro m „ pnorl ., ser , sltiviIy , 0 
predion. whjch , s ,„ a£rMmcnl wiIh [hc rcsuhs obtamcd ftom ^ 

described above. 

Table 22. Sequence Summary of Random Circular Permutations 



Starting Amino 
Acid 


Ending Amino 
Acid 


i 


E142M 


NI44LSE 


LVJ43N 


NI46LSE 


j Y143I 


N144LSE 


; Y14 ^ | N144 


H14SI | NI49I.55F 


HJ4S1 1062SF 


Ul 551 


Kli6SE 


HJ69H 


M/OLSE 


T H (691 


N\ /OLSE 


t«'2M 11/iniSF 


D173I 


Ul /JLSE 


DI73D 


hi /2SE 


i A227A 


A22/1 


J229J ! I22 9I 



GFP stans at Pos t.or £ but the ol , "T" 0, E142M means that 

by the cloning process ' § '"'^ haS bee " chan g ed «> "ethionine 

^.rNM4 A LSE L f ^r t b h y e lh G ir n ° aC ' d S£qUenCe add£d "«* the — • 
144. but ,s appended\v tTcte™ ^^STs^S? ^'T' ? P ° S,,i ° n 
s-mply ends with asparag.ne at posic.on VJESS ™ a^™^' ^ 

To address cpCFP's orientation ,n space in fusion proteins, we constructed a 
came.eon mo.ecule w„h cpCFP replacing CFP (YC3.2,. With this modification the 
FPJET difference between calcium-free and calcum-bound Mates of eameleon 
changes drama.ica.lv compared ,o nom.l eameleons. Th,s demonstrates that cpCFP 
maintains a different orientation in space than CFP, since other factors influencing 
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FRET (inier-fluorophore distance and spectral overlap) have changed only very little. 
These data further suggest that the above list of possible cpGFP mutants represents a 
library of GFP mutants for use in FRET applications. Since most should have 
different orientations in space than regular GFP mutants, every FRET-based 
5 application of a GFP mutant that fails due to poor orientation could possibly be 
improved through use of cpGFP mutants. (Figure 7). 

One obvious use of cpGFP mutants relies purely on their increased pKas. 
Subcellular pi I can be measured with appropriately targeted non-permuted GFP 
mutants; however, the dynamic range of pi Is over which non-permuted GFPs can 
measure is limited by the pKas of available mutants. Since all circular pcrmutants 
thus far investigated have higher pKas than their non-permuted counterparts, they 
theoretically extend the ability of GFPs to sense pH in more alkaline compartments, 
and could allow one to investigate relatively neutral compartments with blue-shifted 
mutants, which was previously not possible. 



10 



15 



Another possible use of cpGFP mutants is in making novel insertions of GFP 
into other proteins for use as biosensors. GFP, because its termini are close in space, 
can be inserted into other proteins, but only rarely to date has it been shown to sense a 
20 conformational change in such a construct. When cpGFP mutants are inserted into a 
protein, they are ropologically similar to the GFP insertion constructs described 
above, and they might reasonably be expected to have similar sensing properties as 
GFP insertions. (Figure 7). 

25 GFP mutants with peptide insertions replacing Y145 were made by 

performing two separate polymerase chain reactions (PCRs). The first PCR amplified 
the N-terminal piece of GFP to include a 5* BAMH1 site and 3' replacement of Y 145 
wit the hexapeptide linker GGTGEL (coded for by DNA containing Kpnl and Sad 
restriction sites for subsequent cloning). The second PCR amplified the C-terminal 

30 piece of GFP to include the 5' linker (GGTGEL) replacing Y 145 and a 3* EcoRl site. 
These two PCR products were combined, amplified with N- and C-tcrminal GFP 
primers to yield a full length fragment, restricted with BarnH I and EcoRl, ligated and 
cloned into the BarnH 1 and EcoR 1 sites of pRSETB (Inviirogen). Subsequent 
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I An isolated nucleic acid sequence which encodes a fluorescent 
indicator, the indicator comprising: 

a sensor polypeptide which is responsive to a chemical, biological, electrical 
or physiological parameter; and 

a fluorescent protein moiety, 

wherein the sensor polypeptide is operatively inserted into the fluorescent 
protein moiety, and wherein the fluorescence of the fluorescent protein moiety is 
affected by the responsiveness of the sensor polypeptide. 

2. The nucleic acid of claim I , wherein the fluorescent protein moiety is 
an /le<?i/or*a-related fluorescent protein moiety. 

3. The nucleic acid of claim 2. wherein the Aec/uoreu -rc\aicd protein is 
GFP, CFP or YFP. 

4. The nucleic acid of claim 1, wherein the sensor polypeptide comprises 
two or more polypeptides operatively linked to each other. 

5. The nucleic acid of claim 1, wherein the sensor polypeptide is selected 
from the group consisting of calmodulin or binding fragment thereof, a calmodulin- 
relatcd protein, recoverin, a nucleoside diphosphate or triphosphate binding protein, 
an inositol- 1,4,5-triphosphate receptor, a cyclic nucleotide receptor, a nitnc oxide 
receptor, a growth factor receptor, a hormone receptor, a ligand-binding domain of a 
hormone receptor, a steroid hormone receptor, a ligand binding domain of a steroid 
hormone receptor, a cytokine receptor, a growth factor receptor, a neurotransmitter 
receptor, a ligand-gatcd channel, a voltage-gated channel, a protein kinase C, a 
domain of protein kinase C, a cGMP-dependent protein kinase, an inositol 
polyphosphate receptor, a phosphate receptor, a carbohydrate receptor, an SH2 
domain, an SH3 domain, a PTB domain, an antibody, an antigen-binding site from an 
antibody, a single-chain antibody, a zinc-finger domain, a protein kinase substrate, a 
protease substrate, a phosphorylation domain, a redox sensitive loop, a loop 

5^ 
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." eSSI CyS ' e,neS ' haI «" *- • «~ *«>«.. and . foreseen, 

protein moietv . 



6. The nucle.e aod of da.m , . , vhera „ „, e sensor , 
"Irnodul.n or a calm 0 d u | m - re la,ed proiei.n moicly. 

_ j Tl " ■<«■ »^-" " ■ "herein ,„, sensor pol w , dc , 

C MK cTT C0 " S,S "° S °" '"^^ '» ofskMLCKp, smMLCK, 
" C :' Je!m0 "' Ca ' SPe ™ i "- !*»PtaftK,oWn« ca,c,„e ur i,, p „„sp h orv,ase 

cydase Bo,de,e„a p e m , ss , s adcny|y , ^ Ncuromodulin. Specrin, MA RCKS ' 
F5,. -Adducn, HSP90a. H.V., gP ,00, BBMHBI. Dilu ,e MHC , M «op™, ' 
Mcl.mn. Glucagon, Secre,,,,, VIP. GIP, and Model Peptide CBP2. 

». The nucle.c acid „fc,ai m , . ta „er compnslng . mo , etv ^ 

. C "" m ' n " ° f — P»»'Pe P .ide ,o ,„e fluoresce, prorein 

moiety. 

9. The nudcic acid of claim wherein the indicator further comprise, a 

localization sequence. 

>0 The nudeic acid of data 2. wherern thc se „ sor polypep „ dc 
one or „,„„ anrino acids be,™ res.dues ,28 ■ ,< S , rcs.dues ,5 5 .,60, res.dues ,6 S - 
1/6 or res.dues 227-229 of ,he fluoresce, pro.ein m „, c ry 



B ' 41A " M8 ' Dl55 . Hl«». Em, D,73. A227, or 1229 of rhe „ 

protein moiety. 



repi 

uorescent 



■ 12. • The nucleic acid of claim 2, wherein thc sensor polypeptide replaces 
residue Y145 of the fluorescent protcm moic.v. 
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13. The nucleic acid of claim 8. wherein the linker moiety replaces El 42, 
Y143, Y145, H 148. D1S5. H169, E172. D173, A227 r or 1229 of the fluorescence 
protein moiety. 

14. The nucleic acid of claim 8 ? wherein the linker moieiy replaces residue 
Y145 of the fluorescence protein moiety. 

If. An expression vector containing the nucleic acid sequence of claim 1. 

16. A transgenic non-human animal comprising a nucleic acid sequence 
according to claim J . 

17. An expression vector comprising expression control sequences 
operatively linked to a nucleic acid sequence coding for the expression of a 
fluorescent indicator, the indicator comprising: 

a sensor polypeptide which is responsive to a chemical, biological, electrical 
or physiological parameter; and 

a fluorescent protein moiety, 

wherein the sensor polypeptide is operatively inserted into the fluorescent 
protein moiety, and wherein the fluorescence of the fluorescent protein moiety is 
affected by the responsiveness of the sensor polypeptide. 

1 8. A host cell transfected with an expression vector comprising an 
expression control sequence operatively linked to a sequence coding for the 
expression of a fluorescent indicator, the indicator comprising: 

a sensor polypeptide which is responsive to a chemical, biological, electrical 
or physiological parameter; and 

a fluorescent protein moiety, 

wherein the sensor polypeptide is operatively inserted into the fluorescent 
protein moiety, and wherein the fluorescence of the fluorescent protein moiety is 
affected by the responsiveness of the sensor polypeptide. 



1 9. The cell of claim 1 8. wherein the cell is a prokaryote. 
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20. The cell „f claim I «, ,herem the ce)J ,, £ ^ 

. 21 - The cell of cla.m , 8 , u herein ,,,, ^ |s a ^ 

22. Thecdlofda I n 1 2l,where I n ( hece,l I saye ast cell. 



23. The cell of claim 21, xvhcrein the cell," 



s a mammalian cell. 



nd c ato compris ,„ g . scnTOrpo , w , idc wh . ch is » 
b.olog,c.l. eIcclric „ or physiologlca , parameier; Ch «™«'. 

a fluorescent protein moiety. 

wherein the .sensor po.ypepnde is opera tivelv inserted into the fWescent 
protem mo.ety, and wherein the fluorescence of ,h, n ""orescent 
affected hv rh. ""orescent protein mo.etv is 

affected by the responsiveness of the sensor polypeptide. 

2> The polypeptide of claim ?4 wherein rh. « 

^ " GF p T ™:'::; p ;r orclain * *■* 

27. The polypeptide of claim ?4 wherein rh» 

- o r morc polypcp ,, dcs opt „ b , y ]mkcd ^ — P^-c co mp „ scs 

28. The po,ypcp,,de of claim 24, wheran , he Mnsor poI _ 

: ~;t p cons,s,, " E ° fca, ™ du,i " - °— ^ c 

™d U h„. re ,a, e d proIcm , rccovcnn a nudcM]de ^ 

2^ p™™. „ ,„os„o,. , .«., ripteptoc , p ; ^ 

mine oxide rccepior, a growdi faclor rccsotor , l,„ recepior. a 

*n*n of a „o™„„ c rcccplor . „ J! * h0 ™ 0 " e " "^nd.b,„d,„ 8 

s.cro.d ho™o„e rcccp.o, a cv,o k ,„e r ecep,o r . a gr o„h fa cror re «p,o r a 
r «e P ,o„ a „ s a„d,a,ed eha„„ e , a ,o, ra§e ,a,ed 
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kinase C T a domain of protein kinase C ; a cGMP-dependent protein kinase, an inositol 
polyphosphate receptor, a phosphate receptor, a carbohydrate receptor, an SH2 
domain, an SH3 domain, a PTB domain, an antibody, an antigen-binding site from an 
antibody, a single-chain antibody, a zinc-finger domain, a protein kinase substrate, a 
protease substrate, a phosphorylation domain, a redox sensitive loop, a loop 
containing at least two cysteines that can form a cyclic disulfide, and a fluorescent 
protein moiety. 

29. The polypeptide of claim 24. wherein the sensor polypeptide is 
calmodulin or a calmodulin-related protein moiety. 

30. The polypeptide of claim 24, wherein the sensor polypeptide is 
selected from the group consisting of a calmodulin-binding domain of skMLCKp, 
smMLCK, CaMK.il, Caldesmon, Cakspermin, phosphofructokinase calcineurin, 
phosphorylase kinase, Ca2+-ATPase 59 kDa PDE, 60 kDa PDE, nitric oxide 
synthase, type I adenylyt cyclase, Bordetella pertussis adenylyl cyclase, 
Neuromodulin, Spectrin, MARCKS, F52, -Adducin, IISP90a, HIV-1 gp!60, 
BBMHBI, Dilute MHC, Mastoparan, Melinin, Glucagon, Secretin, VIP, GIP, or 
Model Peptide CBP2. 

31. The polypeptide of claim 24 : further comprising a linker moiety 
linking the N- and C-terminal ends of the sensor polypeptide to the fluorescent protein 
moiety. 

32. The polypeptide of claim 24, wherein the indicator further comprises a 
localization sequence. 

33. The polypeptide of claim 25, wherein the sensor polypeptide replaces 
one or more amino acids between residues 1 28 - 1 48, residues 1 55- 1 60, residues 168- 
176 or residues 227-229 of the fluorescent protein moiety. 
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p. ,i V1J , J*""*"****"*™:'. „. h „ einIhe sens „ polypeptiderep|aces 

E < Y ■ «. > , 4,. H MS. D , 55, H , 69. E . 72, D , 73, A22 7 . or ,229 of the 

fluorescence protein moiety. 

35. The po.ypept.de of claim 25. wherein ,he sensor po.ypep.icfe rep.aces 
resmueYMS of the fluorescent protein moiety. 

36. The polypeptide of claim 3 1 , wherein the Iinker moietv replaces 
res,due E.42. V ,43, YI 45 , H]48 , D.55. Hi 69, EI 72, DI 73 , A227 , I229 of the 

fluorescent protein moiety. 



Y M Ju' 0 ThCPOlyPeP " de ° rdaim3 '' »^^l,„ k e rm c,c,v re p, a ccres,d U e 

i 145 of the fluorescent protein moiety. 

38. A fluorescent indicator, ,he indicator comprising- 

a sensor polypeptide which , responsive ,o a chemical. o.olog.cal, electrical 

or physiological parameter; and 

a fluorescent protein moiety, 

wherein the sensor polypeptide is operative* .nsened ,nto the fluorescent 
prote,n moiety, and wherein the fluorescence of the fluorescent protein moiety is 
affected by the responsiveness of the sensor polypeptide renting ln protonatlon or 
deprotonanon of the chromophore of the fluorescent protem moiety. 

39. The i ndicator of Cairn 38. wherein the fluorescent protein moiety is an 
^uorra-related fluorescent protein moiety. 

40. The indicator of claim 39, wherein .he^o^-related protein is 
GFP. CFPor YFP. ^ 

41. The indicator of claim 39, wherein the sensor polypeptide replaces 
residue Y145 of the fluorescence protein moiety . 
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42. A method for detecting the presence of a response-inducing member in 
a sample, comprising contacting the sample with a fluorescent indicator, the indicator, 
comprising: 

a sensor polypeptide which is responsive to a chemical, biological, electrical 
or physiological parameter; and 

a fluorescent protein moiety, 

wherein the sensor polypeptide is operatively inserted into the fluorescent 
protein moiety, and wherein the fluorescence of the fluorescent protein moiety is 
affected by the responsiveness of the sensor polypeptide; and 

detecting a change in fluorescence wherein a change is indicative of the affect 
of the parameter on the sensor polypeptide 

43. The method of claim 42, wherein the affect is a change in electrical or 
chemical potential. 

44. The nucleic acid of claim 8, wherein the linker moiety is GGTGEL 
(SEQ ID NO: 1 ) or FKTRHN (SEQ ID NO:2). 

45. The polypeptide of claim 31, wherein the linker moiety is GGTGEL 
(SEQ ID NO: I ) or FKTRHN (SEQ ID NO:2). 

46. An isolated nucleic acid sequence encoding a circularly permuted 
fluorescent protein moiety comprising: 

a linker moiety linking the amino-terminal and carboxy-terminal amino acids 
of a fluorescent protein, wherein the amino and carboxy termini are linked as internal 
amino acids in the circularly permuted fluorescent protein moiety; and 

two terminal ends, wherein the first end is an ami no- terminal end and the 
second end is a carboxy terminal end and wherein the amino and carboxy terminal 
ends of the circularly permuted fluorescent protein moiety are different from the 
amino-terminal and carboxy-terminal amino acids of the fluorescent protein. 



47. The nucleic acid of claim 46. wherein the fluorescent protein is an 
Aequorea-reteted fluorescent protein moiety. 
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48- The nucleic ac,d .sequence of claim 47 wherein rh , 
protein moiety , s GFP, CFP or YFP. ' ^"/""--related 

49. The nucle,c acid of claim 46. wherein the lint, 
GOTGEL (SEQ ID NO: , ), GGTGGS (SEQ ID NO 6 "'^ 

uu M i>LQ ID NO:63) or FKTRHN (SEQ ID NO:2). 

50. The nucleic acid of claim 47 wherein rh, 

icmuucs J e»«-1 76 or residues 227 -??Q nf rk« o 
protein. of the fluorescent 

51. The nucleic acid of claim 47 . 
consisting of NI44.N146.N144.NI49 K.« v,~„ . 

53. The nucleic acid of claim *„u 
se teed from «h e group c„„ sis ,i„ e „ f Mlmodulin . ^ 

— «• • dlPh05Phare ; r Ir , ph h ;;;i a 

IZ r ° f Pr ° mn klraSe C ' ^*P-*« P™n tinMt m , nos , [o| 
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domain, an SH3 domain, a PTB domain, an antibody, an antigen-binding site from an 
antibody, a single-chain antibody, a zinc-finger domain, a protein kinase substrate, a 
protease substrate, a phosphorylation domain, a redox sensitive loop, a loop 
containing at least two cysteines that can form a cyclic disulfide, and a fluorescent 
protein moiety. 

55. The nucleic acid of claim 52 ? wherein the sensor polypeptide is 
calmodulin or a calmodulin-related protein moiety. 

56. The nucleic acid of claim 52, wherein the sensor polypeptide is 
selected from the group consisting of a calmodulin-binding domain of skMLCKp, 
smMLCK, CaMKlI, Caldesmon, Calspermin, phosphofructokinase calcineurin, 
phosphorvlase kinase, Ca 2+ -ATPase 59 kDa PDE, 60 kDa PDE. nitric oxide synthase, 
type 1 adenylyl cyclase, Bordetella pertussis adenylyl cyclase, Neuromodulin, 
Spectrin, MARCKS. F52, -Adducin, HSP90a, HIV-1 gpl60, BBMHBI, Dilute MHC. 
Mastoparan, Melittin. Glucagon, Secretin, VIP, GIP, and Model Peptide CBP?. 

57. The nucleic acid of claim 46, wherein the circularly permuted 
fluorescent protein further comprises a localization sequence. 

5S. An expression vector containing the nucleic acid sequence of claim 46. 

59. A transgenic non-human animal comprising a nucleic acid sequence 
according to claim 46. 

60. An expression vector comprising expression control sequences 
operatively linked to a circularly permuted fluorescent protein moiety comprising: 

a linker moiety linking the amino-terminal and carboxy-tenninal amino acids . 
of a fluorescent protein, wherein the amino and carboxy termini are linked as internal 
amino acids in the circularly permuted fluorescent protein moiety; and 

two terminal ends, wherein the first end is an amino-terminal end and the 
second end is a carboxy terminal end and wherein the amino and carboxy terminal 



63 



WO 00/71565 

.„ - , . PCT/USOO/13684 
ends of the Circularly permuted fluorescent prote.n moiety are different from the 

am.no- termmal and carboxy-terminal ammo acids of the fluorescent protein. 

61 A host cell transfected with an expression vector compnsing an 
express.cn control sequence operatively linked to a sequence coding for the 
expression of a circularly permuted fluorescent protein moiety comprising- 

a linker moiety linking the ammo-termina, and carboxy-terminal amino acids 
of a fluorescent protein, wherein the amino and carboxy termini are linked as internal 
am.no acds in the circularly permuted fluorescent protein moietv; and 

two terminal ends, wherein the first end is an amino-termina! end and the 
second end ,s a carboxy terminal end and wherein the amino and carboxy terminal 
ends of the circularly permuted fluorescent protein moiety are different from the 
am.no-term.nal and carboxy-terminal am.no acids of the fluorescent protein. 

62. The cell of cla.m 61, wherein the cell is a prokarvote. 

63. The cell of claim 62, wherein the cell is E. coli. 

64. The cell of claim 61 , wherein the cell is a eukaryot.c cell. 

65. The cell of claim 65, wherein the cell is a yeast cell. 

66. ' Thecellofclaim 65, wherein the cell is a mammalian cell. 

67. An isolated polypeptide comprising a Circularly permuted fluorescent 

protein moiety comprising: 

a linker mo.ety linking the ammo-terminal and carboxy-terminal amino acids 
of a fluorescent protein, wherein the amino and carboxy termini are linked a, interna. 
am.no acds in the circularly perrnuted fluorescent protein moiety: and 

two terminal ends, wherein the first end is an .mine-terminal end and the 
second end ,s a carboxy terminal end and wherein the amino and carboxv terminal 
ends of the c.rcularly permuted fluorescent protein moiety are different from the 
am.no-terminal and carboxy-terminal amino acids of the fl uorescent protein. 
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68. The polypeptide of claim 67, wherein the tluoresccnt protein is an 
Aequorca-xz\z\cd fluorescent protein moiety. 

69. The polypeptide of claim 68, wherein the Aequorea-rc\aicd protein 
moiety is a GFP, CFP or YFP 

70. The polypeptide of claim 67, wherein the linker moiety is GGTGEL 
(SEQ ID NO: 1 ) GGTGGS (SEQ ID NO:63) or FKTRHK (SEQ ID NO:2). 

7 1 The polypeptide of claim 68. wherein the amino-terminal end or the 
carboxy terminal end of the indicator has an amino acid of the fluorescent protein, the 
amino acid corresponding to residues 128 - 148, residues 155-160, residues 168-176 
or residues 227-229 of the fluorescent protein. 

72. The polypeptide of claim 68, wherein the amino-terminal end is 
selected from the group consist.ng E142, Y143, Y145, H148, D155, H169, E172, 
D173, A227 and 1229, and the carboxy-terminal end is selected from the group 
consisting of N 144, N146, N144, N149, K 162, K 156, N 170, 1171, D173, El 72, A227, 
and 1229, of the fluorescent protein, respectively. 

73. The polypeptide of claim 67, further comprising a sensor polypeptide. 

74. The polypeptide of claim 73, wherein the sensor polypeptide is 
selected from the group consisting of calmodulin or binding fragment thereof, a 
calmodulin-related protein, recovenn, a nucleoside diphosphate or triphosphate 
binding protein, an inositol- 1 ,4,5-tnphosphate receptor, a cyclicnucleotide receptor, a 
nitric oxide receptor, a growth factor receptor, a hormone receptor, a ligand-binding 
domain of a hormone receptor, a steroid hormone receptor, a hgand binding domain 
of a steroid hormone receptor, a cytokine receptor, a growth factor receptor,.a 
neurotransmitter receptor, a ligand-gated channel, a voltage-gated channel, a . protein 
kinase C, a domain of protein kinase C, a cGMP-depcndcnt protein kinase, an inositol 
polyphosphate receptor,' a phosphate receptor, a carbohydrate receptor, an SH2 
domain, an SH3 domain, a PTB domain, an antibody, an antigen-binding site from an. 
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-body. . s,„ e „ cha ,„ an „ body . . 2mc . fi „ s „ domam a pro[ein kjnaM 
pro,e aS e suostr a ,e, . phosphorylauon ^ , ^ ^ _ ■ 
contain.ng a , , MS , mo cystemes , ha , £M {om a cvci|c d|suiws 

protein moiety. 



calmod ^ P0 ' yPeP " Cle ° f d! " m Wherem <*>' H-P«* - 

calmodul.n o, a ealmodulin-related protein motety. 

76. The polypept.de „f claim 73, wherein ,he sense, p„ lyP ep, id e , s 

"* gr0 " P COnS,S " n8 ° f ' "'""^-^.ns «o mai „ „f skMLCKp 
»-MUX CaMKIl, Ca.desmon. Caisperm.n. P h„sphoft.c,o klna se Ccneuru, 

T°T?'7 ^ Ca "" ATPaSe 59 PDB ' 60 "» PDE ** «* svmtasc 

•ype , adenyiy, cyclase, Borde,e„ a pemass.s adenyly, cy c,ase, Keuromodulin " 

Spectnn, MARCKS, P«. •Adducin, HSP90, H,V- , gpl6 „. BBMHBI, Dilute MHC 
Ma S ,op a ra„. Melirttn, Glu( .. agon> Secre „. n V]p GIp Md Mo(jd pep[ . de cBp2 ■ 

The pol> p ep,ide ofclatm 67, wherein ,he ccularlv pemtuted 
fluorescent p,o,ei„ funher comprises a local.zation sequence. 

'8. A circul a rly permuted fluorescent proiein moieiv comprising 
a hnlter moiety hnking amino-termina, and carho^-termina, mma acids 
of a nuorescen, prote.n, wherein the amino a „d carhoxy termin, m , lnke(1 „ imem>| 
ant.no a c,ds ,n the circularly permuted fluorescent p ro ,e,„ moietv and 

~o , e ™i„ ol ends , wherem , he firsi ^ , s ^ ammo iemimai end an<) 

second end ,s a carta, term™, end and wherein the amino and carooxy ,e™i„a, 
ends of the c.rcularly pe rm „ te d fluorescent protetn mo.ety ar e dtfferen, from the 
am,„o.,e„nm a , a „d earboxy-termina, am.no acids of the fluorescent protein and 

"'7' P °' yPeP " de » " *— I elecn-ica, or 

pnysiological parameter. 
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79. A method of producing a circularly permuted fluorescent nucleic acid 

sequence, comprising: 

linking a nucleic acid sequence encoding a linker moiety to the 5' nucleotide 
of a polynucleotide encoding a fluorescent protein; 

circularizing the polynucleotide with the nucleic acid sequence encoding the 

linker sequence; and 

cleaving the circularized polynucleotide with a nuclease, wherein cleavage 

linearizes the circularized polynucleotide. 

80. A method of producing a circularly permuted fluorescent protein 
comprising expressing the nucleic acid sequence produced by the method of claim 79. 
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YFP, Calmodulin Insertion 
Sequence Summary 

MVSKGEE LFYNGGTMHDQLT QMMTATCF T wmivv HDEL 

YFPNter YFP Cam Nter Cam Cxer YFP YFP Oer 

Linker Linker 

YFP, Zif Insertion 
Sequence Summary 

A^G££.....iry / VGGT RPYACPVF.SrDRPFSR^nFTTPRn ? mTr. r T.vc^^ 

YFPNrer YFP ZiLMter ^ ^ 

Linker T • ,, 

Linker 
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SVQST Zif Inserted Zinc Titration, pH 7.( 

2mM Zn is 67% Brighter than SO^M EDTA 




Wavelength (nm) 
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SECUENCE LISTING 
11C> The F.e = en: s of Che University o: Californi 
12C> FLUORESCENT PROTEIN INDICATORS 



13C> REGENi4"7CWC 

cl40> 09/316,939 

09/315/ 020 
<14 1> 1999-05-2 1 

1 999- 05 -21 

< 1 6 0 > 6 3 

<170> FastSEO for Windows Version 4.C 

<210> 1 
<211> 6 
<212> PRT 

<213^ Artificial Sequence 
<220> 

<223> Linker noiety 
<4C0> 1 

Gly Gly Thr Gly Glu Leu 
1 > 5 

<210> 2 

<211> 6 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223 > Linker rr.oiety 



<400> 2 

Phe Lys Thr Arg His Asn 
1 5 



<210> 3 

<211> 238 

<212> PRT 

<213> Aequorea victoria 



nlTlcl Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pre He Leu Val 
G \u Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser Cly Glu 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
Thr Thr civ Lvs Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
Ser Tyr Gly v 5 : dr. Cys Hhe Ser Arg Tyr Pro Asp His - , Ly s Gin 
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4 ^ s Ben Dh =u PCT/US00/13684 

His Asp Phe Phe Lvs Ser Ala Mez Pro Glu Gly Tyr va 1 Glr. " , Arc 

Thr lie Phe Tyr Lys Asp Asp Giy Asn Tyr Lys Thr Arc , Aio 1^ Va , 



Jis Thr LeU ™ Ar 9 He oiu Leu £ :Je 

Asp Phe Lys Clu A S p Gly Asn lit Le. Gly His Ly S Ne'e G_u Tyr Asn 
J-yr Asn Ser His Asn y.l Tyr lie Xec Ala Asp Jys Pro Lys. Asn Gly 
'He Lys Val Asn Phe Lys He Arc His Asn III Lys Asp Gly Ser J" 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Glv Pro 

185 : c C 



Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr G ' 
195 



20C 



n Ser 
2 05 



LVS Asp Pro Asn Gl u Lvs Ar g ks P His Me, Ile Leu ^ 3 _ u _ 



21S 22C 



Thr Ala Ala ^ly lie Thr His Gly Met Asp Glu L*u --r ■ — 
22S 230 235 " 

<21C> 4 
< 2 1 2 > 239 
<2 12> par 

<212> Aequorea victoria 
c 4 0 G > 4 

Met Val ser Lys Gly clu Glu Leu Phe Thr Gly Val Val Pre * •. . v eu 

., , „, 5 10 - c 

^al Glu Leu Asp Gly Asp Val Asn Gly His ^ys Phe Ser Val sir Glv 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Tnr Leu Lvs ^ n e 

Cys Thr Thr Giy Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
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Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp K;s Me, -,s 
Gin His Asp Phe Phe Lys Ser Ala Mec Pre Glu Gly Tyr Val Glr, Gl,. 

8 5 go * * * 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tv- Lvs 

inn . - - ' n -"z n-ia Glu 



100 10S 



val Lys Phe Glu Gly Asp Thr Leu Va"! Asn Ary lie Giu Lej vs Gl- 
I.e Asp Phe Lys Glu Asp Gly Asn lie Leu Giy His Lys Leu Giu Tyr 



135 - L4 . 



Asn Tyr Asn Ser His Asn Val Tyr lie Mec Ala Asp 

150 155 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn He o u - 



145 " \Tn "" '" eL rtJa ASp L '- /s 3: " -:-'s Asn 

155 160 

16t - -sp -Jly ser 

17 0 

Val Gl. Leu A1 Asp His Tyr Qln Qln ^ 

ieo 185 



Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin III a" a Leu 
195 200 -jpc, 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His y.e-_ Val Leu Leu Phe 

Val Thr Ala Ala Gly lie Thr Leu Gly Mec Asp G^u L<=u -- r • 
" 5 230 235 
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<210> S 
<2U> 23? 
<212> ?RT 

<213> Aequorea vic:cr;a 
<4G0> 5 



Met 


Val 


Ser 


Lys 


Gly 
s 


Glu 


Glu 


Leu 


Phe 


l 

Val 


Glu 


Leu 


Asp 


G 1 y 


Asp 


Val 


Asn 


Gly 








20 












Glu 


Gly 


Gi u 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 




35 










40 




Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Fro 


Val 


Pro 


50 
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Phe 


Gly 


Tyr 


G 1 y 


Val 


Gin 


Cys 


Phe 


Ala 


6 5 
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Gin 


His 


Asp 


Phe 


Phe 
85 


Lys 


Ser 


Ala 


Met 


y 


Thr 


He 


Phe 


Phe 


Lys 


Asp 


Asp 


Gly 






100 










105 


V a -L 




Phe 


Glu 


Gl v 


Asp 


Thr 


Leu 


Val 




115 










120 




He 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn 


He 




130 










135 






Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


14S 










150 








Gly 


He 


^ys 


val 


Asn 


Phe 


Lys 


lie 


Arg 








165 










Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 








180 










185 


Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


H i s 


Ty r 






195 










200 




Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 




210 










215 






Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 


Leu 


Gly 
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<210> 6 

<2I1> 239 

c212> PRT 

<213> Aequorea victoria 



<40o> e 
















Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Fhe 


1 

Val 


Glu 


Leu 


Asp 


5 

Gly 


Asp 


Val 


Asn 


Gly 








20 










25 


Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Giy 






35 










40 




Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Val 


Pre 


50 










55 






Phe 


Gly 


Tyr 


Gly 


Leu 


Lys 


Cys 


Phe 


Ala 
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7C 








Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 










35 










Arg 


Thr 


He 


Phe 


Phe 


Lys 


A.sp 


Asp 


Gly 






10C 










105 


Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 


val 




115 










120 
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Thr 


Gly 


Val 


Va 1 


Pro 


lie 


Leu 


10 










15 




H i s 


L*/3 


Phe 


Se r 


Va 1 


Ser 


Gl y 










30 






Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


He 








4 5 








Trp 


Pro 


Thr 


Leu 


Va 1 


Thr 


Thr 






60 










Arg 


Tyr 


Pre 


Asp 


His 


Met 


Lys 


75 










80 


Pro 


Glu 


Gly 


Tyr 


Va x 


Gin 


Glu 


90 










95 




Asn 


Tvr 


Lys 


Thr 


Arg 


Ala 


Glu 










110 






Asn 


Arg 


He 


Glu 
125 


Leu 


Lys 


Giy 


Leu 


Gly 


K*. S 


l.ys 


Leu 


Glu 


~yr 






14 0 










Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 




155 










160 


His 


Asn 


I : e 


G ' 1 1 


Asp 


G i y 


Ser 


170 










175 




Asn 


Thr 


Pro 


lie 


Gly 


Asp 


Gly 










190 






Leu 


Ser 


Tyr 


Glii 


Ser 


Ala 


Leu 








205 








His 


Met 


Val 


Leu 


Leu 


Glu 


Phe 






220 










Met 


Asp 


Glu 


Leu 


Tv r 


Lys 





235 



Thr Civ Val Val Pro lie Leu 
1C ' 1- 

His Lys Phe Ser Val Ser Gly 
30 

Lys Leu Thr Leu Lys Phe lie 
4 5 

Trp Pro Thr Leu Val Thr Thr 
60 

Arg Tvr Pro Asp His Met Lys 

75 80 
Pro Glu Gly Tyr Val Gin Glu 
90 95 
Asn Tvr Lvs Thr Arg Ala Glu 
1 1 C 

Asn Arg lie Glu Leu Lys Gly 
12 5 
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I^e A^n pho i ^ PCT/US00/J3684 

I-e Asp Phe Lys G-u Asp G 2v As, : i „ Le ,. Q , , 

130 g G "> H ~ s L ys leu Glut Tvr 

r " A " S " £ ^ T - - «• L y . C :„ Lvs .,.„ 

==X «• Ly, v., «sr. P „e ly , „. Arg H , s £ ^ ^ ^ ^ 

Val Gin Leu Ala Asn H — ^-r 1 ~. i?C :7 ~ 

Ala A.p H. s T>r Gin ^ As., Thr Pro lie Gly As p Gly 

Fro Val Leu Leu Pro Asp As:, His Tyr Ler c e . T -r c'° 

1SS -, co ' ~- e ' G - :l Ser Aia Leu 

Ser Lys Asp P ro Asn Glu Lys L s Asp K is Mec Val Jeu 

Val Thr Ala Ala Gly He ^ Leu Gly Met As? ^ 



210 Zi: ~ d ^ ^' 1S Val Leu Leu Glu Phe 

J - 223 

225 "I " u ,J1 >' iV!ec Asp Glu Leu Tvr Lvs 

iJU 235 

<210> 7 
<211> 239 
<212> PRT 

<213> Aequorea Victoria 



He C Val Ser Lys Gly c.u Glu Leu P, e Thr Gly Vai ,, a , prQ ^ ^ 
Val Glu Leu Asp c: : , asp V,: Asn Gly ^ Arg phe ^ ^ ^ ^ 
Clu Gly Glu oiy Asp Ala Thr Tyr ^ Lys Le , Tnr Leu ^ ^ ^ 
Cys Thr Thr G1 V Lys Leu Pro Val Pro Tr P Pre Tnr Leu V 3 : Thr Tkr 
Jju Thr Trp Giy val G , n e ys Phe ser Ar „ Tyr - o 

Gin His Asp Phe Phe Lys Ser Ala Mec Pro gL Gly Tvr Val Gln ^ 
- 3 Thr xie Phe Phe L ys Asp Asp „ L%. Tyr , ys Thr Arg ^ 
Val Lys Phe Glu Gl y Asp Thr ^ 0al As, Ar 3 He Glu Leu Lvs Gly 
He J.p Phe Lys Glu Asp c: y > m :ie Leu Gly His ^ ^ ^ 

Jjn Tyr lie Ser His Asn v.l Tyr :le Thr A. a Asp Lys Gln Lvs A .„ 
CIV He Lys Ala His Phe Lys Tie Ar g H is a" Ile Glu Asp Gly J er ° 



Val Cln > u Ala Asp His Tyr Gln Gln Asn Thr Pro He Gly Asp „v 
Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln H° r Ala Leu 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His „« val Leu Glu Phe 



185 

200 2Q5 
Arg Asp His Met Val 

Val Thr Ala Ala Gly lie ^ Leu Gly Wet Asp _ ^ ^ 

J U ?ic * 



<210> 8 
<211> 238 
<212> PRT 

<213> Aequorea vicccria 
<400? 6 



Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
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Glu Leu Asp Glv As P Val Asn Giy His Lys Phe Ser Vai Ser Gly Giu 

20' 2S 3C 

Gly Giu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Pr.e lie Cys 

Th- Trr G'y Lvs Leu fro Val Pre Trp Pro Thr Leu Val Thr Thr Phe 

SC ' "55 60 

Giy Tyr Glv Leu Gir. Cys Phe Ala Arg Tyr Pro Asp His Met Lys Arg 

65 ™ ,S . 

His Asp Phe Phe Lvs Ser Ala Met Pro Glu Gly Tyr Val Gir. Giu Arg 

e 'c 90 9 = 

Thr lie Phe Phe Lvs Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Vai 

100 105 HO _ 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu _-evs I.ys C-.y ,.e 

il5 120 12S 

Asp Phe Lvs Giu Asp Giy Asn lie Leu Gly His Lys Leu Glu Tyr Asn 

130 13S I 40 

T»r A=n -==r Gly Asn Val Tyr lie Met Ala Asp Lys Gin. Lys msh o.y 
145 " " 150 155 16 < 

lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp G;y Ser Va. 

165 I 70 175 

Gh Leu Ala Asp His Tyr Gin Gin Asn Thr Pr; lie Gly Asp Giy Pre 

180 185 15G 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser A,a .eu _-er 

15b 2C0 205 

Lvs As p Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Pne Va, 

210 215 220 

Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 . 235 

<210> 9 
<211> 238 
<212> PRT 

<213> Aequorea victoria 

<400> 9 .... - 

Mer Ser :vs Glv Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Va. 

1 ' ' 5 1° . 

Glu Leu Aso Glv Asp Val Asn Gly His Lys Phe Ser Val Ser Giy Giu 

" 2C' 25 J ° 

Gly Glu Glv Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Pr.e lie Cys 

3 5 40 _ 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val .hr Thr Phe 

50 55 60 

Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys Arg 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

85 90 
Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Giu Val 

100 IO 5 ~ 1J , T . 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly I.e 

1 -. s 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Clu Tyr Asn 

13C 135 I 40 

Tyr Asn Ser Gin Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asr. ^- y 

- Q 15 c 1 0 v 

-le Lvs Val Asn Phe Lvs He Arg His Asn lie Glu Asp Gly Ser Vai 

165 170 ]?- „ 

Gin Leu Ala Asr His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pr, 

180 1" I 90 
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Val Leu Leu Pro Asp Asn 
1 95 

Lys Asp Pro Asn Glu ^ys 
210 

Thr Ala Ala Glv lie Th >- 

225 * 22C 

<210> 10 

<211> 239 

c212> PRT 

<215> Aequorca Victoria 



His Tyr Leu Ser Tyr Gin 
20C 

Arg Asp His Me: Val Leu 

215 220 

His Gly Mec Asp Glu Leu 
225 
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Ser Ala Leu Ser 
2 0 5 

Leu Glu Phe Val 
Tyr Lys 



<400> 10 

Met Val Ser Lys Gly Glu Glu Leu Pne Thr 3ly Val Val Pro He Leu 

1 5 10 2 c 

Val Glu Leu Asp Gly Asp Val Asn Giy His Lvs Phe Ser Va ' Ser G 1 v 

20 2 5 3 - 

Glu Gly Glu Glv Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lvs Pr- lie 

3 $ 40 4 c 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pre Thr Leu Va "■ ^hr Thr 

50 55 60 

Phe Gly Tyr Gly Val Gin Cys Phe Ala Arc Tvr Pro Asp His L v* 

u_n His Asp Phe Phe Lys Ser Ala Met Pro Glu Glv Tvr v-^ G 1 Glu 

85 90 t' s " 

Arg Thr lie Phe Phe Lys Asp Asp Glv Asn Tvr Lvs Thr irg V< = Glu 

100 105 * nr f 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Ara lie Glu Leu Lv- Glv 

115 120 12 c 

He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lvs Leu Glu Tvr 
130 14Q 

Asn Tyr Asn Ser Gly Asn Val Tyr He Met Ala Asp Lvs Gin L-s Asn 
145 150 155 " ' i6c 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asc- Glv Ser 

165 170 * 1 - J ' C 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr *ro I 1 - Gi" 

iSO 185 19 o ' " ° 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tvr Gin Se- Leu 

195 200 ' 2CS 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Va i Leu Leu Glu Ph» 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Mec Asc Glu Leu -vr Lv= 
225 230 23*5 " 

<210> 11 
<2U> 239 
<212> PRT 

<213> Aequorea victoria 

<400> 11 
Met \ 
1 



50 

Phe Gi> 
65 



Ser 


Lys 


Gly 
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Glu 
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10 
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Val 
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1 5 


Leu 
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Asp 
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G 1 y 
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Val 






115 










120 


lie 


lie 


Asp 
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(j - U 


A Sp 
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130 










13 5 




He 


Asn 


Tyr 


Asn 


Ser 






Va 1 


Tvr 


14 5 
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lie 
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L v s 
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A I a 


Asp 
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Tyr 
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Gin 
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185 


Pro 


val 
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Pro 


Asp 


Asn 


His 


Tyr 




155 










200 




Ser 


Lys 
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Lys 


Arg 
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215 
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lie 
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Pro Glu Gly Tyr Val Gin Glu 

9 3 =" 

Asn Tvr Lvs Thr Arg Ala Glu 

i:o 

Asn Arg lie Glu Leu Lys Gly 
125 

Leu Gly His Lys Leu Glu Tyr 
140 

Met Ala Asc Lys Gin Lys Asn 
155 160 
His Asr. lie Glu Asp Gly Ser 
170 1 7S 
Asn Thr Fro He Gly Asp Giy 
190 

Leu Ser Tyr Gin Ser Ala Leu 
205 

His Met Val Leu Leu Glu Phe 
220 

Met Asp Glu Leu Tyr Lys 
23E 



<210> 12 
<21I> 720 
<212> DNA 

c213> Aequorea victoria 



<40O> 12 

ategegagea 

ggcgacgtaa 

ggcaagct ga 

cccgcgacca 

cagcacgact 

ttcaaggacg 

gtgaaccgea 

aagctggagt 

ggcat caagg 

gaccactacc 

tacctgagca 

ccgctggagc 



agggegayga 

acggccacaa 
ccctgaagt t 
ccccgacc t a 
t cttcaagtc 
aeggcaacta 
t cgagctgaa 
acaac tacaa 
tgaact ccaa 
agcagaacac 
cccagtccgc 
ccgcgaccgc 



cc tgttcacc 
ct ecagegtg 
ca tccgcacc 
cggcgtgcag 
cgccatgccc 
caagacccgc 
gggcat cgac 
cagccacaac 
gatccgccac 
ccccat egge 
cctgagcaaa 
cgccgggatc 



3999 tggtg- 
teeggegagg 
accggcaagc 
tget zcagcz 
gaaggctacg 
gccgaggtga 
t tcaaggagg 
g:ctatatca 
aaca tcgagg 
gacggccccg 
gaccccaacg 
actctcggca 



ccatcctggt 
gegagggega 
tgcccgtgcc 
get accccga 
z ecaggageg 
agt t cgaggg 
aeggcaaca t 
:ggccgacaa 
acggcagcct 
- getge tgee 
agaagegega 
tggacgagc t 



cgagctggac 
tgccacctac 
ctggcccacc 
rcacacgaag 
caccatcttc 
cgacaccccg 
cctggggcac 
gcagaagaac 
gcagctcgcc 
cgacaaccac 
c cacatggtc 
gt acaagtaa 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 



<210> 13 

<211> 720 

<212> DNA 

<213=» Aequorea 



v lccoria 



<400> 13 

atggtgagca 

ggcgacgtaa 

ggcaagctga 

ctcgtgacca 

cagcacgact 

ttcaaggacg 

gtgaaccgea 

aagctggagt 

ggcatcaagg 

gaccactacc 

tacctgagct 

ctgctggagt 



agggegagga 
acggccacaa 
ccctgaagtt 
cct teggcta 
tct tcaagtc 
aeggcaacta 
tcgagctgaa 
acaac tacaa 
tgaac ttcaa 
agcagaacac 
accag teege 
ccgrgaccgc 



gctgttcacc 
gt teagegtg 
ca tctgcacc 
cggcgtgcag 
cgccatgccc 
caagacccgc 
gggcat cgac 
cagccacaac 
gatccgccac 
ccccaccggc 
cctgagcaaa 
cgccgggatc 



ggggtggtgc 
teeggegagg 
accggcaagc 
tget ccgccc 
gaaggctacg 
gccgaggtga 
t tcaaggagg 
ct ct atatca 
aaca tcgagg 
gacggccccg 
gaccccaacg 
actcicggca 



ccatcctggt 
gegagggega 
-gcccgtgcc 
gciaccccga 
t ecaggageg 
agt tcgaggg 
aeggcaacat 
iggccgacaa 
aeggcagegt 
:gctgctcc: 
agaagegega 
- ggacgagct 



cgagctggac 
cgccacct ac 
rtggcccac: 
ccacatgaag 
caccatct tc 
cgacaccc tg 
cc -ggggcac 
gcagaagaac 
gcaqcccgc: 
cgacaaccac 
ccacatggtc 
at acaagt aa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



1 



WO 00/71565 

PCT/US00/I3684 

< 2 1 0 > 1 4 
<211> 720 

< 2 1 2 > DNA 

^213 > Aequorea victoria 
<400> 14 

;;s.5?. c . :sn:n: i"™ nr ct9ss 

33 caagcc g a ccc? 9 aag« Lcc^cacc ac^cla^ ^cc^gcc -^"cca" 
ctcgtgacca ccctgaccca cggcgtgcac t 3 -t-- aQ rr Lts9 '" CMCC 18C 

ccctc " 9t; "«^-« »«««^ 

ct.aaggacg acggcaacca caagacccgc ac-qacjaraa -a- r ~ ' ^^"-c 30C 

9cg.acc 9 c. ccgagc,gaa gggfatcgL ctcfll^ a^c^:^ ^ 9 ™ocI 9 
aagctggagc acaactacal cagc-acaac qc-- 3 r1-r, ^Z 9 -c-ggggcac 420 

™* 9 » C ^«SS "° 



cagccacaac gtccacatca ccgccgacaa =cagaagaac 
catccgccac aacatrqaqq acaa-aar-- ~r = 

«• 

«~— « ™«« 5255 sssk ss 



<210> 15 

<211> 720 

<212> DNA 

< 213 > Aequorea victor is 

<400> 15 



<210> 16 
<211> "14 
<;212> DNA 

<213> Aequorea viccoria 



<400> 16 

oafo?^ 9 S^gaagaact cttcaccgga gttgccccaa tccttgctca atcagatggc 

p"- £:.»-» zzzzz ™- 

:ns-» — r^s:: !«= 

aaagacgacg ggaaccacaa gacacgtgcc gaagccaagt ccgaaggtqa cac"t J 
a-tagaaccg actcaaaagg caccgattct aaagaagaca gaaacatc- -Sa-a"cla- 
lllltll^ *«"»«" ■S9«atgt. tacaccatgg Lc.ca.aci a m i 
cat^tc^ aCCCCaaaaC "gacacaac attgaaaacg gaagcgccra accagca|Ic 
" :"c"" CC t atC " C9aC 99«=c 3 ccc crttaccag. c.acc.tLc 
' BtCL9c ' c - "ceaaagar cccsacgaaa agacaaacca cacaaccc-c 
cttgagt-tg taacagccgc tgggattaca cacggcatgg a-gaact; 



540 



720 



atcgcgagca agggcgagga gctgttcacc cgcgcogtgc c-a— cac- r 3a -,~r„ 
ggcgacgtaa acggccacaa gtccagcg-.a ^ccgacLIg ! 5!!^ C "- ac 

ggcaagctga ccctgaagcc catcc^olcc aceg^clalc Sc~gcqcc c^a^" 
cccgtgacca ccttcggcta cggcctgaag tgcLcqccc ^ccacccc^ " " 
cagcacgact tctccaagtc cgccatgccc gaaggccacg ^cag^S J- 
"caaggacg acggcaacta caagacccgc gccgaggtgl aqcr.clag^ a"c Q 
gtgaaccgca Ccgagccgaa gggcaLcgac ttcaaagagg accgcLca- 9 
aagctggagt acaactacaa cagccacaac gtctata t = I -calfaqaac 

ggcatcaagg tgaaccccaa gatccgccac aacaccgagg ac^Igcq- -"I-rlorr 
gaccactacc agcagaacac ecccatcgqc qacg=ccc-q tqc^cfo^ = Ca9l - cc 9 cc 

SSS SS5SE S5== ™ ™ H™ - 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



:a caaa 



60 
120 
180 
240 
300 
360 
420 
48C 
54 0 
6 00 
660 
714 



s 



WO 00/71565 

<210> 17 
<211> 714 
< 2 1 2 > DNA 
<213> Aecucrca 



PCT/US00/ 13684 



victoria 



<400> 17 
a tgag taaag 
gatgctaatg 
aaacc taccc 
gLcactactt 
ca tgact tt t 
aaagatgacg 
aacagaaccg 
t tggaat aca 
atcaaag tta 
cat ca tcaac 
ctgccccatc 
cc tgagt teg 



gagaagaacc 
ggcacaaatt 
t taaatt tat 
teggt tatgg 
tcaagagtgc 
ggaac tacaa 
age : aaaagg 
a c tacaa etc 
act tcaaaat 
aaaac act cc 
aatctgccct 
taacagctgc 



t ctcactgg- 
t tctgtcagt 
t :gcaccac: 
tct teaatge 
ca tgcccgaa 
gaeaegtget 
tamgat l 1 t 
aggcaat gt a 
tagocacaac 
aat tggegat 
cicyaaaga: 
tgggat t aca 



g z t g c cc caa 
ggagagggtg 
ggaaasctac 
t 1 1 gcaaga t 
gg-_-atgttc 
gasgtcaagc 
aaagaaga tg 
t acat ca tgg 
attgaagatg 
ggccctgt cc 
cccaacgaaa 
catggcatgg 



ttct tgt tga 
aaggtgatgc 
ccg: tccatg 
acccagatca 
aggaaagaac 
t tgaaggtga 
gaaacaL tc t 
cagacasaca 
gaagcgttca 
1 1 1 z accaca 
agayagacca 
a t gaactat a 



actaga tggt 
aacataegga 
gccaacact " 
tatgaaaegg 
t atatct etc 
t acccttgt t 
tggacacaaa 
aaagaatgga 
act agcagac 
caaccat tac 
cat gate ct t 
caaa 



<2I0> 18 

<21 1 > 720 

<212> DNA 

<213> Aequorea victoria 



<400> 18 

atggcgagca 

gecgacg taa 

gecaagctga 

ctcgtgacca 

cagcacgact 

ttcaaggacg 

gtgaaccgea 

aagctggagt 

ggcaccaagg 

gaccactacc 

tacc tgagct 

ctgctggagc 



agggegagea 
acggccacaa 
ccccgaagt t 
cc tteggcta 
tc ttcaagtc 
aeggcaact a 
tcgagctgaa 
acaaccacaa 
tga3Cttcaa 
aqcagaacac 
accagcccgc 
tcgtgaccgc 



gctgt tcac c 
get cagegt = 
catctgcac z 
eggegtgea: 
cgccatgcc : 
caagacccg: 
gegea tcga: 
cageggcaa-; 
gat ccgccac 
cccca:cgg: 
cc tgagcaa : 
egcegggat \ 



gcggtggcgc 
t ceggegagg 
areggcaage 
tget tcgccc 
gaaggctacg 
gccgaggtga 
t tcaaggagg 
gtctatatca 
aacatcgagg 
gacggccccg 
gaccccaacg 
act c t eggea 



cc a t cc tgg t 
gegagggega 
tgcccgtgcc 
gc taccccga 
tecaggageg 
age -cgaggg 
aeggcaaca t 
Lggccgacaa 
aeggcageg t 
tgccgctgcc 
agaagegega 
tggacgagc t 



cgagctggac 
tgccacctac 
ccggcccacc 
ccacatgaag 
caccatcttc 
cgacaccctg 
cc tggggcac 
gcagaagaac 
gcagctcgcc 
cgacaaccac 
ccacatggtc 
gt acaagtaa 



<210> 19 

e211> 720 

<212> DNA 

<213> Aequorea victoria 



60 
120 

ieo 

240 
3C0 
360 
420 
480 
540 
600 
660 
714 



60 
120 
130 
240 
300 
360 
420 
490 
540 
600 
650 
720 



c400> 19 

atggcgagca 

ggcgacgtaa 

ggcaagctga 

ctcgcgacca 

cagcacgact 

ttcaaggacg 

gtgaaccgea 

aagccggagt 

ggcaccaagg 

gaccactacc 

tacccgagct 

ctgctggagt 



aggg c 9 a 99 a 

acggccacaa 
ccctgaagtt 
ccctcggcta 
tcttcaagtc 
aeggcaact a 
tcgagctgaa 
acaactacaa 
tgaact tcaa 
aqcagaacac 
accagtccgc 
ccgtgaccgc 



gctgtccacc 
gttcagcgt g 
catctgcac: 
cggcgtgcag 
cgccatgcc: 
caagacccgc 
gggca tcgac 
cagccagaac 
gat ccgcca: 
ccccatcggc 
cctgagcaaa 
egcegggat z 



ggggtggtgc 
t ceggegagg 
accggcaagc 
tgcttcgccc 
gaaggctacg 
gccgaggtga 
ttcaaggagg 
gtctatatca 
aacatcgagg 
gacggccccg 
gaccccaacg 
act ct eggea 



ccatcctggt 
gegagggega 
tgcccgtgcc 
gc taccccga 
tecaggageg 
ag t ccgaggg 
aeggcaaca t 
t cgccgacaa 
aeggcagegt 
tgctgctgcc 
ag a 3gcgcga 
t ggacgagct 



cgagctggac 
tgccacctac 
c tggcccacc 
ccacatgaag 
caccatcttc 
cgacaccctg 
cctggggcac 
acagaagaac 
gcagctcgcc 
cgacaaccac 
t cacatggtc 
eta caag t a a 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 



WO 00/71565 PCT/US00/13684 

<210> 20 
<211> 5 
<212> PRT 

<2 13> Homo sapiens 
<400> 20 

Lys Lys Lys Arg Lys 
1 * 5 

<210> 21 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 21 

Met Leu Arg Thr Ser Ser Leu Phe Thr Arq Arq Val Gin Pro Ser Leu 

1 5 10 15 

Phe Arg ash He Leu Arg Leu Gin Ser Thr 
20 25 

<210> 22 
<211> q 
<212> PRT 

<213> Homo sapiens 

<400> 22 
Lys Asp Glu Leu 
1 

<210> 23 
<211> 4 
<212> PRT 

<213> Horr.o sapiens 
<220> 

<22 1> VARIANT 

<222> (1) ... (4 ) 

<223> Xaa = Any Amino Acid 

<400> 23 
Cys Ala Ala Xaa 
1 

c210> 24 
<211> 4 
<212> PRT 

<213> Homo sapiens 
c220> 

<221> VARIANT 

<222> (1) ... (4 ) 

c223:> Xaa = Any Amino Acid 

<400> 24 * 
Cys Cys Xaa Xaa 
1 



10 



PCT/US00/13684 

WO 00/71565 

^e°Me?°Asn Asn Trp Glu Val Tyr Lys Leu Leu Aia His lie Arg Pro 

1 5 1C 15 

pro Ala Pro Lys Ser Giy Ser Tyr Th"r Val 
20 25 ■ 

<210> 31 

<211> 24 

<212> PRT 

<213> Bos :aurus 

<40O> 31 „ K 

A i a Arq L ys Giu Val lie Arc Asn Lys lie Arg Aia lie o-y Lys Met 

Is 1C 1S 

Aia Arg Val Phe Ser Val Leu Arg 
20 

<210> 32 
<211> 26 
<212> PRT 

<213> Oryctolagus cunicuius 

Leu°Irg 2 Arg Leu lie Asp Ala Tyr Aia Phe Arg lie Tyr Gly His Trp 

1 5 10 15 

Val Lys Lys Gly Gin Gin Gin Asn Arg Gly 
20 2 - 

<210> 33 
<211> 27 
<212> PRT 

<213> Oryctolagus cunicuius 

Arg°Gly 3 Lys Phe Lys Val He Cys Leu ?hr Val Leu Ala Ser Val Arg 

1 5 15 

He Tyr Tyr Gin Tyr Arg Arg Val lys Pro Gly 
20 25 

<21Q> 34 
<211> 28 
<212> PRT 

< 2 1 3 > Homo sapiens 

Leu°Arg 4 Arc Gly Gin lie Leu Trp Phe Arg Giy Leu Asn Arg lie Gin 



1 



5 10 IS 



Thr Gin lie Lys Val Val Asn Aia Phe Ser Ser Ser 
20 25 

<2lO> 35 

<211> 21 

<212> PRT 

<213> Bos taurus 

TxTlrVhys His Leu Gin Arg Pro lie Phe Arg Leu Arg Cys Leu Val 

a 5 ic 

Lvs Gin Leu Glu Lys 
20 

12 



WO 00/71565 



PCT/US00/I3684 



<210> 36 

< 2 1 1 > 2 1 

<212> PRT 

<21 3 > Bos uurus 

<400> 36 

Thr Glu Lys yet Trp Gin Arg Leu Lys Gly lie Leu Arq Cys Leu Val 

5 10 IS 

Lys Gin Leu Glu Lys 

20 

<210> 37 
<211> 23 
<212> PRT ' 

<2I3> Rattus norvegicus 
<400> 37 

Lys Arg Arg Ala lie Gly Phe Lys Lys Leu Ala Glu Ala Val Lvs Phe 

1 5 10 n 5 

Ser Ala Lys Leu Met: Gly Gin 
20 

<210> 33 

<21 1 > 23 

<212> PRT 

<2 1 3 > Bos taurus 



<4 00 > 3 3 

lie Lys Pro Ala Lys Arg Met Lys Phe Lys Thr Val Cvs Tvr Leu Leu 

1 5 10 ' 5 

Val Gin Leu Mec His Cys Arg Lys Met Phe Lys Ala 



20 



25 



<21C> 39 
<21 1> 20 
<212> PRT 

<213> Bcrdetella pertussin 
H00> 3 9 

He Asp Leu Leu Trp Lys lie Ala Arg Ala Gly Ala Arg Ser Ala Val 

1 5 10 is 

Gly Thr Glu Ala 

20 

c210> 40 
<211> 27 
c212> PRT 

c2 1 3 > Homo sapiens 
<400> 40 

Lys Ala His Lys Ala Ala Thr Lys lie Gin Ala Ser Phe Arc Gly His 

1 5 10 15 " 

I-e Thr Arg Lys Lys Leu Lvs Gly Glu Lys Lys 
20 25 

<210> 41 
<211> 24 
<212> PRT 

<213> Homo saniens 



WO 00/71565 



PCT/US00M3684 



^s C »hr\la Ser Pro Trp Lys S,r Ala Arg Leu Me, Val His Thr Val 

Ala Thr Phe Asn Ser lie Lys.Giu 
20 

<210> 42 
<211> 25 
c212> PRT 

c212> Rattus ncrvegicus 

Ly s°Lvs 2 Lys Lys Lys Arg Phe Ser Ph. Lys Lys Ser Phe Lys Leu Ser 

I 5 10 

Glv Phe Ser Phe Lys Lys Ser Lys Lys 
20 2S 

<210> 43 
<211> 24 
<212> PRT 

<213> Mus mus cuius 

Ly s°Lvs 3 Lys Lys Lys Phe Ser Phe Lys Lys Pro Phe Lys Leu Ser Gly 

Leu Ser Phe Lys Arg Asn Arg Lys 
20 

< 210> 44 
<211> 31 
<212> PRT 

c213> Homo sapiens 

^"Gln Lys Glu Lys Thr Arg Trp Leu Asn Thr Pro Asn Thr Tyr 

Leu Arg Val Asn Val Ala Asp Glu Val Gin Arg Asn Met Gly Ser 
20 25 30 

<210> 45 
<211> 21 
<212> PRT 

<213> Mus musculus 

llVlsp^n Val Ala Asn Ser Ala Phe Gin Glu Arg Leu Arg Lys His 

x 5 10 

Gly Leu Glu Vol He 
20 

<210> 46 
<211> 21 
<212> PRT 

<213> Human immunodeficiency virus type 1 



;;: o ;il 6 A rg L eu ^ 9 ASP ^, ^, V e v .i Ly . ^ 9 ^ ^ 



1 



14 



WO 00/71565 

03 PCT/USOO/13684 

Leu Lej Gly Arg Arq 
20 

<21C> 47 

<211> 23 

<212> PRT 

*213> Bos taurus 

<400> 47 

Gin Cli, Leu Ala Thr- Leu lie Gin Lys rhr Tyr Arg Gly Trp Arg Cys 

Arg Thr His Tyr Gin Leu Met: 
2C 

<210> 48 
<211> 24 
<212> PRT 

<213> Mus musculus 
<400> 48 

Arg Ala Ala Cys lie Arg lie Gin Lys Thr He Arc Gly Trp Leu Leu 

Arg Lys Arg Tyr Leu Cys Met: Gin 
20 

<210> 49 
<211> 14 
<212> PRT 

<213:> Vespa crabro 
<400> 49 

He Asn Leu Lys Ala Leu Ala Ala Leu Ala Lys Lys lie Leu 

<210> 50 
<211> 26 
<212> PRT 

<213> Apis mellifera 
<400> 50 

Gly He Gly Ala Va 1 Leu Lys Val Leu Thr Thr Gly Leu Pro Ala Leu 

1 5 10 15 

He Ser Trp He Lys Arg Lvs Arg Gin Gin 

20 25 

<210> 51 

<211> 30 

<2I2> PRT 

c213> Bos taurus 

<400> 51 

His Ser Gin Gly Thr Phe Thr Thr Ser Asp Tyr Ser Lys Tvr Leu Asp 

1 5 10 - 5 

Ser Arg Arg Ala Gin Asp Phe Val Gin Trp Leu Met Asn Thr 
20 25 * 3C 

<210> 52 

<211> 27 

<212> PRT 

c2 13 > Sus scrof a 



15 



WO 00/71565 



PCT/TJSOO/13684 



^°ser 2 As P Gly Thr Phe Thr Ser Glu Leu Ser Arg Leu Arg Asp Ser 



Ala Arc Leu Gin Arg Leu Leu Gin Gly Leu Val 
20 " 2b 



. 10 " 



<210> 53 

<211> 28 

<212> PRT 

<213> Hoxo sapiens 

<400> S3 



s"a« P Ala val Phe Thr Asp Asn T/r Tnr Arg Leu Arc Lys Gin 



His — ~r ^ 10 

Met Ala val Lys Lys Tyr Leu Asn Scr lie Leu Asn 

2 0 2b 

<210> 54 

<211> 33 

<212> PRT 

<213> Sus scrota 

< 4 0 0 > 5 4 



-* r °^-P «y Thr Phe lie Ser Asp Tyr Ser Ala lie M eL Asn Lys 



5 10 



lie Ar 9 Cln Gin Asp Phe Val Asn Trp Leu Leu Ala Gin Gin Gin Lys 



20 25 



Ser 



<210^ SS 

<211> 17 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Calmodulin binding peptide- 



:; 0 s°Leu S Tr p . L ys Lys Leu Leu Lys Leu Leu Lys Lys Leu Leu Lys Leu 

1 * 10 

Gly 



<210> 56 
<211> S 
<212> PRT 

c213> Artificial Sequence 

Hill Linker moiety; sequence repeated indefini 
<400> S6 

Gly Gly Gly Gly Ser 
1 5 

<210> S"7 
<211=> 12 
<212> PRT 

16 



WO 00/71565 

<213> Artificial Sequence 
<220> 

<2 23 > Linker moiecy 
<400> 57 

Gly Lys Ser Ser Gly Ser Gly Ser Glu Ser Lvs w 
1 5 10 



PCT/USO0/I3684 



<210> 56 
<211> 14 
<212;> PRT 

<213> Artificial Sequence 
<220> 

<2 23 > Linker moiety 



<4C0> 58 

Gly Ser Thr Ser Gly Ser Gly Lys Ser Ser Glu G"y L-s 
1 ^ 10 

<210> 59 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Linker moiety 



<400> 59 



Gly Ser Thr Ser Gly Ser Gly Lys Ser Ser Gl 



Lys Gly 



10 



v Ser Gly Ser Thr 
15 



<210> 60 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Li nker moiety 
<400> 60 

Gly Ser Thr Ser Gly Ser Gly Lys Ser Ser Glu Gly Lvs Gl- 

<210> 61 
^211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Linker moiety 



c40C> 61 

Gly Ser Thr Ser Gly Ser Gly Lys Pro Glv Ser Cly Glu Civ S-r Thr 
Lys Gly 



17 



WO 00/71565 PCT/US00/13684 



<210> 62 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<2 23> Linker moiety 
<4O0> 62 

civ Lys Ser Ser G'.y Ser Gly Ser Glu Ser Lys Glu Phe 

~: - 5 io 

<210> 63 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

c2 23> Linker moiety 
<4C0> 62 

Gly Gly Thr Gly Gly Ser 
1 5 



18 



?S SENSOR? 



S6 


356347 


SENSOR? 


?S PROTEIN? 




S7 


88919 


PROTEIN? 


?S S6*S7 








356347 


S6 




88919 


S7 


S8 


812 


S6*S7 


?S S8*S4 








812 


S8 




63 


S4 


S9 


3 


S8*S4 
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parameter, and a fluorescence protein functional group (III), is new. 
(II) is operatively inserted into (III), and the fluorescence of (III) 
is affected by the responsiveness of (II). 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: 

(1) an isolated nucleic acid sequence (IV) which encodes a FI which 
comprises a (II) and (III); 

(2) an expression vector (V) containing (IV); 

(3) a transgenic non-human animal comprising (IV); 

(4) an expression vector (VI) comprising expression control 
sequences operatively linked to (IV); 

(5) a host cell (VII) transfected with (VI); 

(6) FI which comprises (II) and (III); 

(7) an isolated nucleic acid sequence (VIII) encoding a circularly 
permuted fluorescent protein moiety (IX) which comprises a linker group 
(LM) linking the amino-terminal and carboxy-terminal amino acids of a 
fluorescent protein, in which the amino and carboxy termini are linked 
as internal amino acids in the (IX) and two terminal ends (TTE) in 
which the first end is an amino-terminal end and the second end is a 
carboxy terminal end and in which the amino and carboxy terminal ends 
of (IX) are different from the amino-terminal and carboxy-terminal 
amino acids of the fluorescent protein; 

(8) an expression vector and transgenic non-human animal comprising 
(VI II), 

(9) an expression vector (X) comprising expression control 
sequences operatively linked to (VIII); 

(10) a host cell (XI) transfected with (X); 

(11) an isolated polypeptide comprising (IX) which comprises LM, 
TTE and also (II); 

) (12) producing (VIII) involves linking a nucleic sequence LM to the 
5^ nucleotide of a polynucleotide encoding a fluorescent protein, 
circularizing the polynucleotide with the nucleic acid sequence 
encoding the linker sequence and cleaving the circularized 
polynucleotide with a nuclease, hv wMnh rlpavpo-p t^p ri mil ^r>i <*nA 



is new. It comprises: (a) a first chemical entity binding region (CEBH; 
that binds to a chemical entity (CE) and to a second protein partner; 

mi (b) a FP in frame with the first CEBR; where the FP is a first ETP 
for a second ETP attached to the second protein partner. 

Also claimed are: ^ . - 

(1) a system for monitoring protein-protein energy transfer 

partner, comprising (I); . . _ . . , . , T v 

(2) an expression vector containing a. nucleic acid as m (1), 

(3) a transgenic non-human animal comprising a nucleic acid 

sequence as in (I); . . . . , 

(4) an expression vector comprising expression control sequences- 
operatively linked to a nucleic acid sequence C9ding for the expression 
of a fluorescent indicator, the indicator comprising, (a) a binding 
protein moiety (BPH) having an analyte-binding region which binds an 
analyte and causes the indicator to change conformation upon exposure 
to the analyte, (b) a donor FP moiety fused to the BPM, and (c) an 
acceptor FP moiety fused to the BPM, where the donor moiety and the 
acceptor moiety change position relative to each other when the analyte 
binds to the analyte binding region, altering fluorescence resonance 
energy transfer (FRET) between the donor moiety and the acceptor moiety 
when the donor moiety is excited; . /J( v 

(5) a host cell transfected with an expression vector as in (4); 

(6) a transgenic non-human animal havinga phenotype characterised 
by expression of a nucleic acid sequence as in (I); 

(7) an isolated nucleic acid sequence which encodes a fluorescent 
indicator as in (4); . , . 

(8) an expression vector containing a nucleic acid sequence as in 

(8) * 

'(9) a transgenic non-human animal comprising a nucleic acid 
sequence as in (7); . . . , 

(10) an expression vector comprising expression control sequences 
operatively linked to a nucleic acid sequence coding for the expression 
of a fluorescent indicator as in (7); t . 

(11) a host cell transfected with an expression vector comprising 
an expression control sequence operatively linked to a sequence coding 
for the expression of a fluorescent indicator as in (7); . 

(12) a transgenic non-human animal having a phenotype characterised 
by expression of a nucleic acid sequence as in (7), the phenotype being 
conferred by a transgene contained in the somatic and germ cells of the 
mouse, the transgene comprising a nucleic acid sequence which encodes a 
fluorescent indicator specific antigen polypeptide; t 

(13) a fluorescent indicator comprising, (a) a BPH haying an 
analyte-binding region which binds an analyte and causes the indicator 
to change conformation upon exposure to the analyte, (b) a donor 
moiety covalently coupled to the BPM, and (c) an acceptor FP moiety 
covalently coupled to the BPM; where the donor moiety and the acceptor 
moiety change position relative to each other when the analyte binds to 
the analyte-binding region, altering FRET between the donor moiety and 
the acceptor moiety when the donor moiety is excited; and 

(14) a fluorescent indicator comprising, (a) a target peptide 
moiety, (b) a BPM having an analyte-binding region which binds an 
analyte and causes the indicator to change conformation upon exposure 
to the analyte, and a peptide-binding region for binding the target 
peptide moiety, (c) a linker moiety that covalently couples the binding 
protein and the target peptide moiety, (d) a donor FP moiety covalently 
coupled to the BPM, and (e) an acceptor FP moiety covalently coupled to 
the target peptide moiety; where the donor moiety and the acceptor 
moiety change position relative to each other when the analyte t binds to 
the analyte-binding region, altering FRET between the donor moiety and 
the acceptor moiety when the donor moiety is excited, where the 
indicator is a single polypeptide. . 

USE - The fluorescent indicators can be used to determine the 
concentration of an analyte both in vitro and in vivo, but without 
being invasive to the organism under test. The binding of an analyte 
moiety causes a change in a protein expressed by the transgenic 
animals, thereby causing a readily detectable result. 
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